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PREFACE 


The growing importance of the statistical method in physics is 
amply attested by the recent publication of several elaborate treatises 
emphasizing in their titles the words statistical mechanics. There 
have also lately appeared a number of other books containing detailed 
application of statistics to the properties of matter in the solid, liquid, 
and gaseous states. Most of these, though very useful to the special- 
ist, are likely to appear rather formidable to the student who i9 just 
embarking on graduate study and who wishes a thorough but not too 
lengthy introduction to the method of statistical physics. The author 
has tried to provide this in the present work, which is intended for 
readers equipped with an introductory background in theoretical 
physics. 

In this book the attempt has been made to survey as thoroughly 
as possible the various ways in which statistical reasoning has been 
used in physics from the classical applications to fluctuation phe- 
nomena, kinetic theory, and statistical mechanics to the contemporary 
quantum mechanical statistics. Emphasis has been laid on meth- 
odology. The author has taken the point of view that the greater 
the number of vantage points from which the subject is examined the 
deeper will be the student’s understanding. For this reason no 
attempt has been made to provide a strictly unified treatment which 
would appear to be more logical to many. At the same time, how- 
ever, particular effort has been exercised to relate the various statisti- 
cal methods in order that the reader will see their similarities as well 
as their differences. A glance at the table of contents will show that 
specific applications have not been neglected, and there are numerous 
problems to test the reader’s grasp of the subject. 

It is appropriate for the author to acknowledge in the preface his 
indebtedness to those who have given help or encouragement. Such 
acknowledgment is usually confined to professional colleagues or the 
writings of the masters. Too seldom is attention paid to the con- 
tribution of the author’s students, whose thorough and patient study 
of the various stages of organization of the book leads to gradual 
improvement in the correctness and clarity of presentation. The 
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author feels that he owes a definite debt of gratitude to the graduate 
students at Brown University who during the past few years have 
taken the course on which the book has been based. Particular 
acknowledgment is due to Mr. J. A. Rich for help with the diagrams 
and the proof. 



CONTENTS 


CHAPTER I 

Dynamical and Statistical Theories 

SECTION PAGE 

1 The Method of Physics 1 

2 The Nature of a Dynamical Theory . . .... 2 

3 The Nature of a Statistical Theory . 4 

4 Non-Mechanical Statistics g 

CHAPTER II 

Elementary Probability and Statistics 

1 A Simple Problem in Physical Statistics 11 

2 Elementary Probability . . 12 

3 Calculation of Averages . , 13 

4 Laplace's Formula . . ... ... 16 

5 Evaluation of the Coefficient in Laplace’s Formula. Stirling’s Formula.. 18 

6 Use of the Normal Distribution Law in the Calculation of Averages .... 21 

7 Unequal a prior* Probabilities . . 23 

8 Generalized Laplace’s Formula . . 25 

9 Volume Distribution of Gas Molecules. Fluctuations .... . . 26 

10 The Shot Effect as a Fluctuation Phenomenon 29 

11 Radioactive Emission as a Fluctuation Phenomenon 31 

12 Poisson’s Formula . 32 

13 The Theory of Errors 33 

CHAPTER III 
Review of Thermodynamics 

1 Fundamental Concepts . . 41 

2 The Two Laws of Thermodynamics 42 

3 Free Energy and Other Thermodynamic Functions 45 

4 Some Thermodynamic Relations ... 46 

5 R£sum6 47 

CHAPTER IV 

Classical Maxwbll-Boltzmann Statistics 

1 Statistical Distribution of N Objects in n Groups 50 

2 The Canonical Distribution .... .53 

3 Properties of the Canonical Distribution. Interpretation of Thermody- 

namic Relations 59 

vii 



CONTENTS 


viii 

SECTION PACE 

4 Distribution or Partition Function for a System of Free Particles. Free 

Energy of an Ideal Gas . 65 

5 Equipartition of Energy in a System of Free Particles. Entropy of an 

Ideal Gas 67 

CHAPTER V 

The Kinetic Theory of Gases 

1 The Virial and the Equation of State of an Ideal Gas 70 

2 Some Simple Kinetic Theory Properties of an Ideal Gas 74 

3 Collisions and Mean Free Path of a Gas . . . 78 

4 Elementary Kinetic Theory of Viscosity . 81 

5 The Maxwellian Distribution of Velocities 83 

6 Application of the Maxwell Distribution to Collisions. 86 

7 Transport Phenomena .... .89 

8 The Boltzmann 77-theorem . . 92 

9 Equation of State of a Real Gas. Virial for Interaction Forces. . 94 

10 The Brownian Motion and Molecular Reality 98 

CHAPTER VI 

Classical Statistical Mechanics 

1 The Characteristics of Statistical Mechanics 102 

2 Review of Advanced Mechanics . 103 

3 Phase Space 108 

4 The Gibbsian Ensemble . 110 

5 Liouville's Theorem. The Equation of Continuity in Statistical Mechanics 114 

6 The Fundamental Postulate of Statistical Mechanics ... 118 

7 The Microcanonical or Energy-Shell Ensemble 119 

8 Component Systems . . 124 

9 The Canonical Ensemble .... . . . 126 

10 The Maxwell-Boltzmann Distribution Law . . . 131 

11 Deviations of Quantities from Their Average Values .... 134 

12 The Statistical Mechanical Interpretation of Temperature 139 

13 The Statistical Mechanical Interpretation of Entropy . ... 141 

14 Free Energy and the Gibbs Phase Integral 146 

15 Alternative Interpretation of Entropy 148 

16 R6sum6 151 

CHAPTER VII 

Statistical Mechanics by the Method of 
Darwin and Fowler 

1 Fundamental Concepts . . . . 154 

2 Evaluation of Averages 157 

3 Assembly of Two Kinds of Systems 164 

4 Statistics of an Ideal Gas 170 

5 The Concept of Entropy .... 173 

6 The Partition Function and Gibbs's Phase Integral 180 



CONTENTS 


ix 


CHAPTER VIII 


Fundamentals of Quantum Statistics 

SECTION PAGE 

1 Review of Quantum Mechanics 183 

2 Distributions in Quantum Statistics ... 188 

3 Alternative Treatment of Quantum Statistical Distributions . 194 

4 Evaluation of »/ for an Assembly of Free Particles 200 

5 Quantum Statistics of a Weakly Degenerate Gas . 204 

6 Weakly Degenerate Gas by the Darwin-Fowler Method 209 

7 Fermi-Dirac Statistics of a Strongly Degenerate Gas 213 

8 Fermi-Dirac Statistics of a Degenerate Electron Gas 218 

9 Bose-Emstein Statistics of a Strongly Degenerate Gas 220 

10 Statistics of a Photon Gas. Radiation Law. 222 

11 Application of Quantum Statistics to Atomic Structure . . 226 


CHAPTER IX 

Specific Heats of Gases and Solids 

1 Specific Heats of an Ideal Monatomic Gag . 230 

2 The Ideal Diatomic Gas . ... . . 231 

3 Specific Heat of a Crystalline Solid 236 

CHAPTER X 

Quantum Statistical Theory of Electrical and Thermal 
Properties of Metals 


1 The Lorentz-Sommerfeld Theory of Electrical and Thermal Conduction 244 

2 Heat Production in a Conductor. The Thomson Effect 257 

3 The Peltier Effect 260 

4 Thermoelectric Electromotive Force. The Seebeck Effect 261 

5 Transverse Galvanomagnetic and Thermomagnetic Effects 262 

6 Motion of Electrons in a Crystal Lattice Simple Treatment 263 

7 Application of Statistics to Magnetism . 272 


CHAPTER XI 

Emission of Electrons from Surfaces 


1 Simple Statistical Theory of Thermionic Emission.. . . . .. 277 

2 Transmission of Electrons through a Potential Barrier and Application to 

Thermionic Emission . 280 

3 Transmission of Electrons through a Potential Hill and Application to 

Thermionic Emission 285 

4 Effect of Strong Fields on Emission 289 

5 The Photoelectric Effect . 292 

6 Contact Potential ■ . . . 298 

Index 301 




CHAPTER I 


DYNAMICAL AND STATISTICAL THEORIES 

1. THE METHOD OF PHYSICS 

Physics is an attempt to describe a certain portion of human 
experience. From the totality of sense perceptions, the physicist 
abstracts certain ones for special study and by observation and ex- 
periment develops a body of propositions called physical facts. He 
then proceeds to construct on the basis of these facts certain concepts 
which are defined in terms of the more primitive ideas. Such, for 
example, are the kinematical concepts of mechanics: displacement, 
velocity, acceleration. , These concepts are expressed in symbolic 
form so that they are amenable to the usual processes of mathematical 
manipulation. Next, with experience as a guide and usually also 
with liberal use of the imagination, certain relations, appropriate to 
a given set of phenomena, are postulated among the concept symbols. 
These with the concepts themselves form the hypotheses of the phys- 
ical theory which is supposed to be the ultimate physical description 
of the phenomena. From these postulated relations, usually in the 
form of differential equations, can be derived by mathematical anal- 
ysis the laboratory equations which are called physical laws. 

Physical laws are equations containing quantities which have a 
direct operational significance in the laboratory and to which numbers 
can be assigned by experiment. Hence these laws are susceptible of 
experimental test. If they meet this test successfully the theory 
from which they have been deduced is to that extent successful and 
a valuable element of physical description. This does not mean, 
however, that the theory is thereby proved to be true. In the first 
place there may be quite another theory operating with different 
concepts and different hypotheses which yields the same laws; in 
the second place the attempt to extend the theory to include just 
one additional bit of experience may fail of verification. It is much 
safer therefore to say of a physical theory that it is successful or un- 
successful: the successful theory is one which not only implies laws 
agreeing with already known experience but also predicts laws for 
experiments that have not yet been tried and which leads to complete 
verification on experiment. Such a theory adds materially to human 

1 
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knowledge, since it suggests experience which presumably no one 
had previously taken the trouble to acquire. 1 

It is clear that different kinds of physical theories can be con- 
structed. The above remarks suggest that in any comparison of 
these we should then consider the following elements: (1) the fun- 
damental concepts, (2) the postulated relations or basic hypotheses, 
and (3) the types of physical laws resulting from (2). On this basis 
we shall now try to compare two different theoretical structures of 
importance, viz., the dynamical and the statistical. 

2. THE NATURE OF A DYNAMICAL THEORY 

The theory of dynamics or mechanics, as it is often called, has 
had remarkable success in describing a wide variety of natural phe- 
nomena. It is the oldest of physical theories and its concepts, hy- 
potheses, and laws have been so thoroughly studied that they have 
acquired an air of familiarity not shared by those of any other theory. 
This does not mean that the structure of dynamical theory is less 
abstract than that of electromagnetic theory, for example, but merely 
that we have got so used to thinking in terms of mechanics that we 
no longer feel the abstractness. Its essential aim is to describe first, 
all the observed motions of bodies and second, the physical phe- 
nomena in which motion is not actually observed, in terms of the 
motions of invisible bodies. The fundamental dynamical concept 
is the material particle 8 which is assumed to have position without 
extension, the property of inertia whose measure is mass, and certain 
relations with respect to other particles, e g., gravitation. The con- 
cepts of displacement, velocity, and acceleration of a material particle 
are constructed using the primitive notions of space and time. Cer- 
tain hypotheses are then introduced, forming the content of what 
are commonly known as the “laws” of motion, e.g., P = ma. These 
might better be called the “principles" of mechanical theory, since 
from them by suitable manipulation the laws, i.e., the laboratory 
equations containing time and distance, etc., which describe the 
actual motions of particles, can be derived. , 

Now classical mechanics, constructed in this way, has been very 

‘This statement of the method of physics is a highly compressed treatment 
which naturally does not pretend to do justice to the profundity of the theme. For 
a more elaborate statement the reader is referred to Lindsay and Margenau, ‘‘Foun- 
dations of Physics," Chapter I, John Wiley & Sons, New York, 1936. 

* Cf , op. at.. Chapter III lor an extensive exposition of the fundamentals of 
mechanics which are here given only in abbreviated form. 
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successful in dealing with the motion of a single particle in a fixed 
reference system, of two particles moving subject to their mutual 
interaction (a much more practical situation, of course, than the 
highly idealized isolated particle), and of a large number of particles 
which are rigidly connected so that they exhibit no relative motion 
(rigid dynamics). Classical mechanics has also proved itself adequate 
to describe the motion of ideally continuous aggregates of particles 
such as fluids (hydrodynamics). But it is important to recognize 
that the general problem of the motion of a large number of discrete 
particles subject to the action of arbitrary forces cannot be com- 
pletely solved by classical dynamics. It is true that much can be 
learned about the motion from the well-known deductions from 
dynamical principles: the laws of the conservation of mechanical 
energy and momentum. The complete solution would imply that, 
given the initial conditions (the initial position and initial velocity 
of every particle) the position and velocity of every particle are de- 
terminable for all time. Even if the solution of the problem could 
be given it would be useless for a very large number of particles, be- 
cause of our inability to assign the initial conditions. Suppose, for 
example, that we are trying to describe the behavior of a gas on the 
assumption that it consists of a very large number of tiny particles 
which move under the action of their mutual forces in accordance 
with the principles of mechanics. It turns out that the number of 
particles (molecules) per cubic centimeter of the gas has to be so large 
that it is completely hopeless to try to specify position and velocity 
for all of them at any one instant. It then appears that the dynamical 
method suffers here a serious check. 

In a situation of this kind the physicist does not give up in despair. 
He looks around for possible ways out. Two at least would appear 
to be available. In the first place one could decide to replace the large 
number of discrete particles by an ideal continuum and apply me- 
chanical principles to this. Unfortunately it appears that while, in 
following this course, we are able to describe certain properties of 
the gas, there are other very important ones which elude this mode 
of attack. Therefore we fix our attention on the alternative pro- 
cedure, which is simply to forego exact information about individual 
particles and to allow our questions to concern merely the average 
number of particles which have a certain range of properties at a 
given instant. When we do this we are departing from strict dynami- 
cal theory and are introducing a method which has received the name 
statistical. 
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3. THE NATURE 07 A STATISTICAL THEORY 

We have now to try to make clear how a statistical theory differs 
basically from a dynamical theory in physics. To some persons this 
may well appear a futile task, for it will be asked, how can one under- 
stand a new theoretical point of view until he has studied all its de- 
tails? The answer is that unless he is given some idea about it and 
its relations to things otherwise familiar he may not wish to study 
it at all; at any rate he will not study it with the same appreciation. 
People have a certain curiosity to know something about what they 
are getting into when they begin to learn a new discipline. Without 
laying claim to any particular profundity, in this section we shall 
go a little way toward satisfying this curiosity. 

At first thought it might seem that the difference between a statis- 
tical theory and a dynamical theory is not so very great, at any 
rate in the case we have above described, viz., the motion of a very 
large number of particles. The fundamental concepts used are those 
of classical mechanics, i.e., velocity, acceleration, force, mass, energy, 
etc. Moreover, the principles of mechanics are employed as funda- 
mental assumptions, but the kinds of laws derived are not those of 
classical dynamical systems, and we have to look closely to find 
just where the difference lies. In the first place the statistical laws tell 
us nothing about any particular individual particle; they are entirely 
concerned with numbers of particles which have certain properties, 
e.g., position in space, velocity, momentum, energy, at a given in- 
stant. Moreover they do not even pretend to tell us the precise 
number of particles in any case, but only an average number, from 
which there will in general be fluctuations. The idea of an average 
quantity is not foreign to classical dynamics, where we often find it 
useful to speak about the average velocity of a particle, or the average 
kinetic energy of a simple harmonic oscillator, for example. In clas- 
sical dynamics, however, we never speak of the average number of 
particles in a given state; this is a definite criterion distinguishing 
between the two modes of description, even though the same funda- 
mental concepts occur in both. 

Moreover, even though both classical dynamics and statistical 
physics employ averages their significance is quite different; when 
in dynamics we employ averages over time it is merely a matter of 
convenience and not because we cannot compute the precise values 
of the quantities in question at any instant. One of the fundamental 
characteristics of mechanical laws is that once the boundary and 
initial conditions have been inserted, they predict precisely the values 
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of all the variables involved at any instant. Dynamics implies abso- 
lute determinism for any physical system. This is well illustrated 
in celestial mechanics, wherein from a few observations the path of 
a planet or even a comet can be computed with great accuracy. Now 
things are quite otherwise in a statistical theory. Here the average 
is employed because there is no possibility within reason of predicting 
an exact value. Hence by the very use of such averages, we forego 
precise determinism. It must be clearly understood that in this re- 
nunciation we do not deny that there may be determinism in the 
system which we are studying, e g., in a gas, the number of molecules 
in a given element of volume may actually be uniquely determined 
as a function of the time. However, the effort to follow through the 
precise variation with time may be so great as to become unreason- 
able; it may defeat its own ends by rendering physical description 
too complicated to be worth while. Rather than be balked by this 
unhappy situation we decide to get along with average numbers over 
appropriate periods of time. We do not worry if the actual number 
at any instant within such a volume interval differs from the average, 
as long as experimental observations indicate that this difference 
does not become too large or persist for too long a time. 

The reader who has followed the above paragraph closely will 
undoubtedly be inclined to ask why the statistical theory is effectively 
any less deterministic than the dynamical theory if we can calculate 
average numbers precisely and get agreement with experiment. Is 
not that all that can be expected of a theory? The answer to this 
question evidently involves the way in which the averages are calcu- 
lated. As we have emphasized, when we compute the average of a 
quantity in classical mechanics, we know the value at every instant 
of time or every point in space as the case may be; the calculation 
of the average is a mere matter of convenience. In the problems 
treated by the statistical method we do not know the instantaneous 
values of the important quantities, and so the question at once arises: 
how do we propose to calculate their averages? To answer this 
query fully is indeed the function of a book on physical statistics. 
However, we can at least say here that the calculation of statistical 
averages is based fundamentally on the concept of probability, a 
concept which seems to have by no means so clear a meaning as the 
concepts of mechanics, for example, but of which nevertheless we 
all feel we have an intuitive grasp. This notion is foreign to dynamical 
theory, though it enters into every element of experience including 
the most precise of physical experiments, where it is reflected in our 
treatment of the variations in the successive measured values of the 
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same quantity, i.e., the theory of errors. To this extent dynamical 
theory is a highly idealized model since it neglects a very significant 
feature of experience. On the other hand the human urge to believe 
in causality has hitherto placed a high premium on the value of a 
deterministic theory. This is a good illustration of the competition 
in the construction of concepts which is constantly at work in the 
human mind in its attempts to describe experience. There are ele- 
ments in experience which make us feel that determinism is the correct 
point of view in physics and incline us toward dynamics; there is 
also much experience which emphasizes the importance of chance. 
This inclines us toward the type of theory which openly employs 
chance; statistics is a theory of this kind. 

There is still another difference between dynamical and statistical 
theories. This is the distinction between reversible and irreversible 
processes. Let us consider a very simple example, namely a particle 
which moves with constant acceleration a along the x axis. We 
assume the initial conditions * = 0, & = 0 at * =* 0 (Note: the dot 
notation is used to indicate differentiation with respect to the time). 
Suppose the particle reaches the position * = xi at time t — t\, where 
*1 *■ 0*i/2, in the usual way. Now Buppose that at / => h the 
existent velocity ij = ati is reversed. What happens? We now 
have the new initial conditions x = x u x — — ati, whence at the 
end of a second time interval of magnitude t\, the distance of the 
particle from the origin becomes x — xi — o*j -f o*i/2. But from 
*1 = atf/2 it follows that x = 0. Also the final velocity at the end 
of the second time interval is x = — ati + 0*1 = 0. This means 
that the motion of the particle has completely reversed itself. By 
the mere reversal of the velocity we have brought the particle from 
the state it had reached at time h back to the original state from which 
it started. We could have achieved the same result by changing the 
sign of the time t — t\ and allowing the t parameter to go from — 1\ 
to zero. It is seen that this is mathematically equivalent to reversing 
the velocity. In any case we have here an example of a reversible 
motion or process, i.e., one which by suitable manipulation of the 
parameters can be made to reverse itself and proceed back through 
all its successive previous states to the initial state, without at the 
same time changing the state of any other physical system. It is 
a comparatively simple matter to prove* that strictly dynamical sys- 
tems undergo reversible processes only. 

Now let us consider a somewhat different illustration. We sup- 
pose that the particle just discussed moves along the x axis under 

* For a short demonstration see op. c it., p. 195. 
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the action of a constant force, but that in addition its motion is re- 
sisted by a force varying directly as its velocity, or in general of such 
a character that it always opposes motion; in short, we suppose that 
a frictional force acts on the particle. If now we carry out the same 
sort of process indicated above, it will be found that the motion does 
not reverse itself as before; the particle does not return to its original 
position and velocity after time ti if the velocity is reversed at t ti 
when x = Xj. In fact a simple calculation (left for the reader to per- 
form) shows that the particle always falls short of reaching its initial 
position under these conditions. This is an illustration of an irrever- 
sible process, or one which can not be annulled simply by reversing 
some of the parameters of the system in question without disturbing 
or changing in any way the environment. Thus, in the example just 
described the only way to get the particle back to its initial state in 
the time interval h after the reversal of the velocity atx — xi at time 
t — h, will be for some outside influence to compensate for the dis- 
sipative effect of the friction. 

The distinction between reversible and irreversible processes is 
of fundamental theoretical significance. Owing to the prevalence 
of frictional forces, it is clear that irreversible processes are actually 
the rule in nature. The question then arises: Why do we use the 
concept of reversible process at all? The answer is that this type of 
process is associated with the dynamical method of description as 
the above illustration (first case) has just showed. However, the 
further question will immediately be asked: Did we not effectively 
give a dynamical description of the irreversible process also and does 
this not mean that both reversible and irreversible processes can be 
described by dynamics? It will be noticed, however, that the solu- 
tion of the second problem was rather artificial since we assumed a 
frictional force proportional to the velocity without in any way seek- 
ing to understand the nature of the frictional force more closely. 
Hence although it is true that classical dynamics can handle some 
kinds of irreversible processes, it is only in a rather formal way and 
difficulties are encountered as soon as a more thoroughgoing treatment 
is contemplated. We can put the matter thus: if we leave out of 
dynamics forces varying as some odd power of the velocity, classical 
dynamics describes only reversible processes. By formal generali- 
zation one can make dynamics describe certain irreversible processes, 
namely, those in which small frictional dissipation enters. The essen- 
tial reason for this is that, although friction is always present in 
natural motions, by suitable manipulation we can make it so small 
as to have a negligible effect over a period of time which is of interest 
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to us. Thus consider as an example a simple pendulum swinging in 
air. The simple dynamical theory of the motion describes it as 
periodic, so that once started the pendulum swings indefinitely with 
a characteristic frequency and the same amplitude with which it 
started. Actually it is not observed to do this ; each swing is a bit 
smaller than the previous one and the motion gradually comes to 
rest. If, however, the motion takes place in a region from which 
some of the air has been removed, the dissipative effect is observed 
to be much less and the pendulum takes a much longer time to come 
to rest. We therefore feel that in the limit of no frictional resistance 
the motion would be completely periodic and hence reversible. Actu- 
ally even in motion in air if we are content to restrict ourselves to a 
time interval which does not exceed too many periods of the motion, 
the dissipation can be neglected and the dynamical treatment is for 
many purposes satisfactory. It comes down to this: for the sake of 
what we call simplicity we use the dynamical theory with its con- 
comitant reversibility when it leads to approximately correct agreement 
with experiment, i.e., in which the irreversibility can be neglected 
in the ideal limit. 

Now there are phenomena in nature which appear to be so funda- 
mentally irreversible that in no ideal limit can we consider them as 
reversible. Such is the flow of heat. It is an experimental fact that 
heat is always observed to flow from a body of higher temperature 
to one of lower temperature as long as no outside influence is imposed. 
If this process were reversible it should be possible, without in any 
way disturbing the state of other bodies but merely by reversing the 
sign of some parameter connected with the flow, to make the flow 
proceed in the other direction. Actually there appears no way of 
doing this: to make heat flow from a low temperature to a higher 
temperature requires external work (as in a refrigerator). Hence the 
process is not reversible. 

Is there anything inherent in the statistical point of view which 
renders it particularly fitted for the description of systems under- 
going irreversible processes? The answer is yes, for we have seen 
that the method of statistics uses average distributions of particles 
with respect to certain properties. It calculates these averages by 
the use of probabilities and the natural assumption is that those dis- 
tributions will actually be realized which have the highest proba- 
bility. Hence there will be a tendency for distributions to change 
in the direction of increasing probability. This change is evidently 
a one-way process and by nature irreversible. To be sure, there is 
involved the curious circumstance that, since we have only proba- 
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bility considerations to guide us, there will always be a finite proba- 
bility of any process going either way, e.g , in the conduction of heat, 
we cannot rule out as impossible the uncompensated flow of heat from 
the cooler to the wanner body. All we are allowed to say is that the 
flow in the other direction is overwhelmingly more probable. 4 

4. NON-MECHANICAL STATISTICS 

In the previous section we were led to the concept of a statistical 
theory through a type of problem which actually employs the funda- 
mental concepts of mechanics but in which the method of dynamics 
is unable to give a complete solution. The type of statistical theory 
which uses mechanical concepts is very important for physics but it 
must be pointed out that it is by no means the only useful kind of 
application of statistics in physics. There are some physical phe- 
nomena in which the concepts of mechanics seem to play no role at 
all. An example is the phenomenon of radioactive decay which appears 
to be best described by saying that in a special, simple case of, disinte- 
gration the number of radioactive atoms disintegrating per unit time 
is directly proportional to the number present. No mechanism is 
provided to govern the disintegration and the treatment is purely 
statistical, leading to the well-known equation 

N = Noe~ u (1) 

giving the number of atoms undisintegrated after time t if N 0 is the 
original number present. It is to be observed that nothing remotely 
connects this formula with dynamics. It merely associates a number 
with the time and does so on a probability basis, i e , the number is 
an average. We may well expect fluctuations from it when the experi- 
ment which it describes is repeated again and again. We must make 
one more important observation: the number N must be a large 
number if the formula is to have much meaning. This, of course, is 
universally true of statistical formulas: they lose their significance 
if the numbers entering into them are too small. 

It is then clear that whether we begin with the assumption (a) that 
all physical phenomena are ultimately describable in terms of me- 
chanical concepts or (6) that some physical phenomena are not 
explicable in this way, we are led to the desirability of using statistical 
reasoning in physics. It is the plan of this book to indicate how this 
can be done and to give many illustrations of the actual process. 

* For further general discussion, op at , pp. 196 ff , may be consulted. 
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PROBLEMS AND QUESTIONS 

1. Prove that when a force varying directly with the velocity acts on a particle 
subject otherwise to a constant force, irreversible motion results. 

2. Make a list of irreversible processes in physics and indicate which of them 
can be replaced effectively by ideal reversible processes. Discuss some of these 
processes in detail. 

3. Prove by the use of Hamilton’s principle (cf. eq. 8 of Chapter VI) that all 
conservative dynamical systems undergo reversible processes only. 

4 . A particle falls under the influence of gravity through a medium which resists 
its motion by a force varying directly as the velocity. Show that the velocity of 
the particle approaches a limiting value and comment on the connection between 
this and the irreversibility of the motion. 

5. Set up the differential equation whose solution is eq (1) of this chapter. 
Give two physical interpretations of the constant X. From the fact that N and No 
in eq. (1) must be integers, what mathematical difficulty do you find associated with 
the formula? Should a physicist be greatly disturbed over this difficulty? Why? 



CHAPTER II 


ELEMENTARY PROBABILITY AND STATISTICS 

1. A SIMPLE PROBLEM IN PHYSICAL STATISTICS 

We begin our study with a definite physical problem which has 
some interest in itself and yet is simple enough to illustrate clearly 
the fundamental statistical methods we intend to develop. 

Consider a single material particle which is restricted to move along 
a straight line, say the x axis. Suppose that in time T it makes n dis- 
placements each of length l, where n » 1. These displacements can 
be either in the positive or negative * direction. In fact we shall 
assume that a positive displacement is just as likely or probable as a 
negative displacement. Let us further suppose that the number of 
positive displacements in time T is n j and the number of negative 
displacements « 2 , so that «i + «j = n. The distance of the particle 
from its starting point at time T is then 

L = l(m - n 2 ). (1) 

Now if we knew all the forces acting on the particle we should, of 
course, be able to calculate by the principles of mechanics the exact 
value of L. However, we are here assuming that we do not know the 
forces and therefore cannot use mechanics. The best we can do is to 
try to calculate an average value of L. To see the meaning of this, 
imagine that it is possible to carry out an experiment and observe 
L directly. Further suppose that the experiment can be repeated 
many times, very likely with differences in the values of L obtained. 
We could then find an average L by direct arithmetical means. 
However, life is too short to spend our time on such experiments. 
What we should like to do is to calculate ail average L with the hope 
that it will agree well enough with that experimentally observed to 
serve as the basis for further theoretical predictions about the behavior 
of the particle. 

As stated in Chapter I, in order to calculate an average when we 
do not know the detailed time-course of the phenomenon, we must 
have and use some probability values. This means that we must be 
able to compute the probability associated with each value of L. 

11 
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2. ELEMENTARY PROBABILITY 

It is not practicable in this book to explore all the problems and 
controversies associated with the definition of probability. It is 
obviously a concept difficult to define logically so as to be proof against 
all objections that can be brought against it. Fortunately for the 
present purpose we can be content with the simple "frequency” view- 
point, which is essentially that of v. Mises. 1 If we consider once 
more the n displacements of the particle (Sec. 1) in time T, an impor- 
tant concept is the total number of ways in which the » displacements 
can be divided into two groups, i.e., positive and negative, without 
any restriction on the number in each group. If there were only one 
displacement it could take place in either of two ways, viz., either 
positive or negative. If there were two displacements there would be 
four ways of performing them, as indicated in the following table 

First displacement + + — — 

Second displacement + — -+■ — 

For « displacements the number of ways in question is 2*. This is a 
mere matter of counting: there are two ways of performing the first 
displacement, two ways for the second, ■ • •, two ways for the nth. 
They are all assumed to be independent of each other, i e., the fact 
that the first happens to be positive or negative entails no restriction 
on any of the subsequent ones. Hence the total number of ways of 
grouping the displacements is 2 • 2 • 2 • • • to « factors or 2". Now of 
all these ways there will be a certain number corresponding to 
positive displacements and n$ negative displacements (where n = 
»i+ nf). This number is very readily obtained. We can choose the 
first positive displacement in n ways, the second in (n — 1) ways, 
the third in (» — 2) ways, • • • and finally the »ith in (« — ni+ 1) 
ways. Since these are all independent the total number of ways 
desired would appear to be »(« — 1) •••(» — wi+ 1). But many of 
these correspond to mere rearrangement of the «i positive displace- 
ments. We are not interested in the order in which the displacements 
occur but merely in their number and must therefore divide the 
above by the number of ways in which ti\ displacements can be re- 
arranged among themselves, namely, nj!. Therefore the total number 

1 R. v. Mises, “Wahrscheinlichkeitsrechnung,” Leipzig, 1931. For a brief re- 
view of this point of view, consult "Foundations of Physics," pp. 1S9 ff. 
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of ways of dividing the » displacements into «i positive and n 2 nega- 
tive without regard to order is 


/« \ _ n(n — 1) (» — n x + 1) 

\»i/ ~ », ! 


( 2 ) 


By multiplying numerator and denominator by n 2 \ we can bring t his 
into the more symmetrical form 



(20 


The situation then is this : Out of 2 n ways of grouping n displace- 
ments into the two classes, positive and negative, there are ^ ways 

for which n\ are positive and n — = n 2 are negative without regard 

for order. The relative frequency of occurrence of the » I , n 2 combina- 
tion is therefore given by the ratio ^/2", and we shall call this the 
probability ■P-, of the occurrence of such a combination. Thus 



n \ _ J_ n! 
,«i/ 2 n »! ! k 2 ! 


(3) 


In our problem this is the probability connected with the value 
L — l(n x— n 2 ) for the final displacement of the particle from its 
initial position after time T. 

The reader familiar with v. Mises’ notation will see that we are 
assuming that the total number of ways of grouping the n displace- 
ments into the two groups forms a so-called probability aggregate. It 
is unnecessary, however, to discuss the fundamental nature of such 
an aggregate. We note only that the quantity P ni in (3) is always a 
proper fraction and that 

n 

E p -. = i - < 3 4 > 

m-o 


3. CALCULATION OF AVERAGES 

We are now ready to use the results of Sec. 2 in the calculation 
of the average value of L. To obtain this we have merely to multiply 
the value of L corresponding to a particular iti,n 2 combination by 
the probability associated with this combination and to sum the 
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products so obtained over all values of n i, namely from 0 to n. If 
we denote the average of L by L, we have then 


(5) 


£ =2 - T' s( ”‘ “ ” s) (”,)• 

ni ■ 0 

The problem now is to evaluate (5). Expanding the sum gives 

2(«i - *,) (” x ) = 2 2»i (* ) - n-2\ (6) 

The computation of Sni ^ is based on the fact that the quantities 
^ are actually the coefficients in the binomial expansion. From 


elementary algebra 
(1 + *)» = 1 + 


(?)* + (”) x ’ + ••• + (“,) 


x ni + 


+ x n 


If we differentiate both sides of this identity with respect to x we get 
« (i + xr~ i (” ) (?') 

Since * is arbitrary we can set it equal to unity and have 

Sr (”.) - ■ 2 "‘ (8) 

which in connection with (5) and (6) at once leads to 

1 = 0. (9) 

This is a not unexpected result. It means simply that on the average 
the number of positive displacements n\ equals the number of negative 
displacements n 2 which is, of course, inherent in the initial assumption 
that a positive displacement is just as likely as a negative one, so 
that indeed we must expect «i = n/2. Mathematically speaking, 
this result is trivial. We have presented the analysis mainly because 
of its future utility. 

There will still be some interest in an average which gives some 
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information about the absolute value of L. Let us average 1? instead 
of L. Thus from (1) 


2 *> -»>’(”.) 





Differentiation of (7') with respect to x yields 

n{n - 1)(1 + a)"' 2 = - 1)(” ) * B, ~ 2 , 

whence, with * = 1 as before, 

n{n - 1 ) 2 B— 2 = 2 n\ ) - n- 2 n ~\ 


Then 

tF («i) = ^ 


( 10 ) 


The other terms in (10) are already known. Hence substitution yields 

L 2 = nl\ (11) 

The square root of L 2 , which we may call the root-mean-square value 
of L, will serve as a kind of average of L without regard to its sign. 
Thus 

Vl 2 = Vnl. (12) 


There is another interpretation of this result. Let us compute the 
mean square deviation of L from its average value. This is called in 
statistical parlance the dispersion and usually denoted by a 2 . We 
then get the important general relation 

<r 2 = {L - L) 2 = I 2 - 2L 2 + I 2 = I 2 - L\ (13) 

In the special case under consideration L = 0, and therefore o 2 = L 2 . 
Here the mean-square value of L is simply equal to the mean-square 
deviation of L from its average. The square root of a 2 , or root-mean- 
square deviation from the average, is usually known as the standard 
deviation. In our problem the standard deviation is the root-mean- 
square of L itself. 

The concepts of mean or average value, dispersion and standard 
deviation, are so important for statistical reasoning that we may well 
pause a little to discuss them further. The idea of average, indeed, 
has probably been discussed enough to be reasonably clear. If we 
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were to make several repetitions of the experiment of observing the 
particle after an interval T, we should expect that some of the dis- 
placements from the origin would be positive, others negative, but the 
average close to zero. We have no reason to expect the average to 
be exactly zero, but if we found on repeated trials that the average 
failed to stay close to zero, we should suspect that we were over- 
looking some important feature of the problem and that our statistical 
reasoning was not applicable or at any rate not applicable in the 
simple form here assumed. In the standard deviation we have an 
additional test. The successive tries will yield values of L different 
from L, i.e., we must expect deviations or fluctuations from the mean. 
However, we have just shown we can calculate an average value for 
these fluctuations, viz., -y/rtl in our problem. We should expect that 
the observed value of this deviation for a great many trials would 
not differ markedly from y/nl. If it did we might suspect that 
something was being neglected. 

Now in most applications of statistics in physics we do not and 
indeed cannot actually proceed to verify our fundamental assumptions 
quite as directly as all this. Rather we assume at the outset that 
the statistical method is the one appropriate for the problem and 
assign probabilities in what appears to be the most plausible manner. 
From these we calculate averages and what is more important, 
relations among these averages and parameters which by the nature 
of the case have fixed values. These relations are the laws (in the 
sense of Chapter I) which should describe the phenomena in question. 
We then proceed to identify the average values with the actual results 
of experiment and hope to find that the statistical laws we have 
derived really do provide an accurate description. 

The fluctuations mentioned above are not to be dismissed as 
unimportant, for at times the theory may predict rather large ones 
and then experiment should certainly reveal them if the theory is 
applicable. 

4. LAPLACE’S FORMULA 

The formula (3) for the probability of the occurrence of »i positive 
displacements with negative ones in a total of n = n\ + «a (usually 
called Newton’s formula) is not convenient for mathematical manipu- 
lation since it contains the important number »i and consequently L 
in the form of the factorial. There is advantage in expressing the 
probability explicitly as an analytic function of L. This has been 
done in an approximation formula associated with the name of 
Laplace. 
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Let us express J in terms of L. From L — (»i — na)l and 

«x + «a * » we have at once «i = (n + L/l )/ 2 and n a = (n — L/J)/2. 
For simplicity let L/2/ = x. Then we can write 


» \ «! n![(«/2)!] 2 

fit) (n/2 + *) ! (n/2 - x) ! (n/2 + x) ! (»/ 2 - x ) ! [(n/2) !]“* 


n! n/2 - (n/2 — 1) --(n/2-x-ft) 
[(n/2)!] 2 -(»/2 + x) • (n/2 + x - 1) - - - («/2 + 1)‘ 


(14) 


The form (14) assumes tacitly that x is positive. The form will change 
if x is negative but the reader can show that the same ultimate result 
is obtained. It is assumed, of course, that n is a very large number 
compared with x. Multiply both numerator and denominator by 
(2 /»)* and get 

(n\ n! (1 - 2/«)-(l - 4/»)- • (1 - (2s - 2 )/n) 

W [(»/2) !] 2 ' (1 + 2/ n) • (1 + 4/n) - • - (1 + 2 x/n) ' ' } 

Under the assumption just made (15) can be written to a very close 
approximation 

(n\ n\ e -l/" (2+4+8+ • +2x-2) 

\nj ~ [(n/2) !J 2 ‘ e 1/n (2+4+0+ +2 *> ' k ’ 


Making use of the arithmetical progression formula 2 + 4 + 6 + • • • + 
2x = x 2 + *. etc., we can finally write ^ in the form 



711 g — 2i*/n 

[(n/2) l] 2 


n 1 .-L>/2iU« 

[(n/2) l] 2 


( 17 ) 


in which L is now released from the bondage of the factorial and 
appears in explicit form in the exponential. Equation (17) is commonly 
termed Laplace’s formula. The function e~ ax ' has long been known as 
the Gauss probability or error function. It has the well-known form 

indicated in the accompanying Fig 2 1, where we have plotted 


CO 


as a function of x. 



accurately only 


Strictly speaking Laplace’s formula represents 
for x « n/2 and its value for larger values of x 
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might appear highly questionable. Fortunately we are usually inter- 
ested in such large values of it, that for large x the value of the ordinate 
is very small indeed compared with its value at * — 0, and the per- 
centage error involved in the use of the analytical formula in place of 

Newton’s factorial formula 
becomes very small. This 
enables us to use Laplace’s 
formula in the evaluation 
of averages, for which its 
form renders it particularly 
suitable. The error function 
is an even function, i.e., the 
value for given x is the 
same as for —x, expressing 
Fig 2 1. the fact that a given positive 

value of L is equally prob- 
able with —L. Moreover the maximum value occurs for x = 0 or 
L — 0, indicating maximum probability for the average value. 



5. EVALUATION OF THE COEFFICIENT IN LAPLACE’S FORMULA. 
STIRLING’S FORMULA 


Practical use of Laplace’s formula (17) can be made only if the 
factorial coefficient «!/l(»/2) 1 ] 2 is evaluated in terms of a simple 
function of n. This involves essentially the development of a formula 
for nl. Such a one is Stirling’s formula. There are several ways of 
deriving this presented in books on advanced calculus. We shall give 
a brief derivation here as an excuse for introducing the gamma function, 
which will be of considerable use to us later. Thus by definition for 
any real positive n and z (we are here contented with real variables) 


- A- 

Jo 


e ‘dz, 


whence a partial integration gives T(w) = (!/») / z n e ‘dz, so that 


Jo 


nr(n) = T(» + 1). But since T(l) = 1 by direct integration, we have 
for integral values of n 

r(w + 1) = »!, (19) 

so that 


«! = f e n e ‘dz. 
Jo 
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If we make the substitution z = n ■+■ V nu, taking advantage of the 
fact that for z = « or u = 0 the integrand in (19') attains its maximum 
value, 


n\ 


Vie-* e 


n log ( n+Vnu) — Vnu 


du. 


( 20 ) 


We can now write log (n + Vnu) = log n + log (1 + Vl /nu) and 
expa nd th e second term in a Maclaurin series, getting log (1 -f Vi/nu) 
* V^/nu — v? /2n to terms of the second order which are sufficient 
for our purpose since n is assumed to be farge. Substitution into (20) 
then yields 



Now the integral , 



du = 4>(ac) 


( 22 ) 


is known as the error function and plays an important role in the 
applications to follow. The particular case when x - 0 is important. 
We have * 

<f>(0) = VVi / e'^du = 1. (23) 

Jo 

Consequently (21) becomes 

»! = Vne~ n n n [V2ir - V*j2 HVn)]. (24) 


But as x increases indefinitely 4>(x) — * 0. Hence, since n is assumed to 
be large we are justified in neglecting V^/2 $(%/») compared with 
V 2ir. This yields the approximation 



The demonstration is not intended to be taken as mathematically 
rigorous. Stirling’s formula is, however, a very useful and surprisingly 
accurate one, with a very small percentage error even for n as small 
as 10. The absolute error grows with n, but this is of little conse- 
quence in most physical applications. 

We are now ready to apply the expression (25) to Laplace’s 
formula (17). The result is the approximate formula 

(26) 

* Cf., for example, F. S. Woods, "Advanced Calculus," p. 153, Ginn and Com- 
pany, 1934, or any similar book on the subject. 


» \ 2" +1 g _L»/2nP 

,»!/ V 2vn 
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This at once shows that ^ is a maximum for «i * • *»/2 or Z>« 

0 and decreases more or less rapidly as L increases. 

It is now desirable to associate a probability not with a single 
definite value of «j or L but a finite (though small) range of values 
of L. We shall agree to define the probability associated with the 
interval L to L + dL as 


Pi d 



2 

\/2irn 



(27) 


Since the probability is a pure number with no physical dimensions 
we use d(L/2l) instead of dL to denote the interval. Moreover 
L/ll = x measures the deviation of n x and n 2 from their mean values. 
In the expression (27) Pl appears as the probability associated with 
unit interval of L/% in the neighborhood of L/2L The form of (27) 
is governed by one other criterion: if we integrate both sides with 
respect to the independent variable L/21 and allow L to take on the 
limiting values — nl and +»/ we expect to obtain the value unity since 
it is certain that L will be somewhere in the interval from — nl to +nl. 
We thus demand that, as n — » «, 

£> 4 )- 1 ' < 28 > 

where the limits have been changed to those of x. From (22) and 
(23) there results 

* [1 “ (29) 

and since from the even character of the integrand, the integral in (28) 
is double that in (29), we conclude that as n becomes very great, (28) 
is true if Pi, is given by (27). 

Equation (27) is usually known as the normal or Gaussian distri* 
bution law. It has already been emphasized that it is a good repre- 
sentation of the Newtonian algebraic distribution law (3) only in the 
range where L/l < n. This does not limit its usefulness as much as 
might be supposed, since in the region where L is large, the large 
exponent in the exponential function makes Pl vanishingly small in 
any case. This fact renders the normal distribution law very useful 
in the calculation of averages. 
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6. USB OF THE NORMAL DISTRIBUTION LAW IN THE CALCULATION 

OF AVERAGES 

The advantage of the normal distribution (27) is that it corre- 
sponds to a continuous variation in L and therefore reduces the 
averaging process to a mere matter of integration. An objection may 
be made that the actual distribution is not continuous! for L makes 
only discrete steps. Nevertheless, if n is very large the difference 
from step to step becomes a very small fraction of the maximum L, 
and the variable L/21 = x can be considered continuous to a sufficiently 
good approximation. Indeed it is not to be expected that the normal 
distribution will apply to all statistical problems in physics. We shall 
later meet with some in which it is an extremely poor approximation. 
Nevertheless here, where the probabilities of positive and negative 
displacements are equal, if t«», the value of the distribution 
cannot be doubted. We shall see indeed that it gives the same values 
of x and (r 2 as were obtained above with the algebraic formula. 

Thus for the average x ( = L/21), the usual way of defining an 
average for a continuous distribution gives 

x — f xP L (x)dx = •• /*" xe _2l * /n dx => 0, (30) 

V2ir» J - «, 

where the result of the integration follows at once from the fact that 
xe~** /n is an odd function of x. The limits of the integration are 
from — °o to + 00 , whereas strictly speaking x is limited to the range 
— w/2 to +»/ 2. But if n is very large the introduction of the infinite 
limits involves no greater error than is already present in the choice 
of ?i(x). We shall hereafter consistently use the infinite limits for the 
sake of mathematical simplicity. 

The average value of the magnitude of x, i.e , | x | , is 

T*T - -4= r xe- 2 * /H dx = 4= = 0.399 (31) 

y/lrnJa \2v 

This gives 

\L\ = 0.798 Vnl. (31') 

In similar fashion 

? = - 4 = r '*•-*"* dx = 2 ■ < 32) 

V2j m J q 4 

This result is in agreement with the one obtained by purely algebraic 
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means in eq. (10). For L 2 = 4 l 2 x? = »/ a , leading to =» 1 V» or 

= 0.500 V». (33) 

which is also a measure of the standard deviation of the distribution aB 
a function of *. It is of interest to note that whereas the values of 
|* | and Vi? actually differ in value by about 20 per cent, neverthe- 
less they are of the same order of magnitude. If they had proved to be 
of quite different orders of magnitude, we might well have had our 
doubts about the value of the distribution law. It is natural to inquire 
about the averages of higher order, e.g., 'V / x*. We see at once that the 
odd power averages all vanish for the same reason that x — 0. The 
reader may show that for k even 

V** = (1-3 5 ••• (k - l)] 1 '*^- (34) 

The significance of this result is that all the averages are multiples 
of -y/n with coefficients slowly increasing with k. The root-mean- 
square average is, of course, the most useful one in statistical problems. 

There is, to be sure, another type of average somewhat different 
from those just considered. This is the arbitrarily termed “probable ” 
average. It is the value of *, usually denoted by x P , such that there 
are just as many cases in which * exceeds this in absolute value as 
there are cases in which x is less than this in absolute value. This is 
equivalent to saying that the ordinate for | xp | divides the area under 
the probability curve (Fig. 2 ■ 1) to the right of the origin into two 
equal parts. This area has however the numerical value Jij. Conse- 
quently 


By consulting the table of values of the probability function $(*), we 
find that this requirement can be uniquely satisfied by 

x P = 0.338 Vn, (35) 

approximately. We note the agreement in order of magnitude with 
| x | and V?. Placed in order, xp < | x | < V**. 

An important measure of the significance of these average values is 
found in the probability that the value of the deviation x shall not 
exceed them. The probability that * shall not exceed the value Xq 
in absolute magnitude will be written 

P(*o) = -4= r *~ 2 * /n dx. 

V2 rn J Q 



jr 


p 1 

-** /n dx = 

4 


e 


( 36 ) 
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For *o a “ X P> — }4 by definition. More generally, from the 

definition of the error function $(x) in eq. (22), 

ThusP(17f) = 1 - $(0,798) = 0.576, while P(Vj?) = 1 - $(1) = 
0.683. We can express this as follows : On the average there are 576 
chances in 1,000 that a deviation or fluctuation will turn up which is 
equal to or less than, i.e., not exceeding Tx]", while there are 683 chances 
in 1,000 that the deviation x shall not exceed Vi?. There are 500 
chances in 1,000 that * shall not be greater than xp. The probability 
rises with the magnitude of the average. 

7. UNEQUAL A PRIORI PROBABILITIES 

The physical problem so far treated in this chapter is a rather 
idealized one, though a somewhat more general case of it occurs 
in the so-called Brownian movement of small colloidal particles 
suspended in a liquid or a gas. This we shall discuss later in some 
detail. One of the fundamental idealizations in the problem is the 
assumption that a positive displacement of length / is just as likely as 
a negative displacement. There are few problems in physical statistics 
in which a simple assumption of this kind can be made with any success. 
Therefore, although the mathematical developments we have carried 
out are fundamental, they are not all immediately applicable to actual 
physical situations. It should indeed be mentioned that interesting 
illustrations of the formulas of the preceding sections may be found 
in the simple physical experiment of coin tossing. This can be made 
formally analogous to the physical problem of Sec. 1 merely by 
associating heads with a positive displacement of the particle and 
tails with a negative displacement. The problems at the end of this 
chapter show how well our theoretical expressions agree with experience 
of this kind. 

Now, however, a generalization is in order. We shall assume that 
the two types of displacement are not equally likely but that the 
a priori probability of a positive displacement is p / q times that of a 
negative displacement, where p and q are positive proper fractions 
with the property that p + q ™ 1. The term a priori demands 
careful consideration. Two possibilities are at hand : (a) we may have 
observed by direct experiment that the fraction p/q represents the 
relative frequency. In most physical problems this course is not open 
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and hence we must use the other possibility: (6) assume the ratio 
from the best information we have about the system in question. 

The problem of the preceding sections corresponds to p — q — 

We now desire to derive the more general formulas. Again consider n 
displacements divided into two groups, i.e., «j positive and »a nega- 
tive, with «i + «j = ». If the probability of a positive displacement 
is p, the probability of ti\ positive displacements independent of any 
negative displacements will be p" 1 . The negative displacements be- 
have similarly. Hence the probability of a combination of ni posi- 
tive displacements with n 2 negative displacements, if there were only 
one way of realizing it, would clearly be p n 'q nt . However, we have 


already seen that there are 



ways of realizing this distribution. 


Consequently the actual probability corresponding to this combination 
is 





(38) 


The calculation of the average displacement L in this case proceeds 
thus: 



(39) 


Since SP„, = 1 the second term in (39) reduces to nl. Now write 






(40) 


By differentiating both sides with respect to x, we get (after letting the 
parameter * = 1) 

np = g»i ("Jf’V 1 - (41) 

Therefore substitution into (39) gives 

L = n(p — q)l. (42) 

We could indeed have computed »i and » 2 directly and have found 
by the above method n{ = np and nj = nq, so that L = (»I — tT 2 )1 = 
n(p — q)l. We see that the general result (42) agrees with the special 
formula (9) when p = q = giving L ■= 0. However, for p 7 * q, 
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L 0. In particular, for p » q, we get approximately L — npl or 
practically nl. __ 

The calculation of the mean-square displacement L 2 follows in 
similar fashion with the result 

L 2 = nl 2 [Apq + n(p - q) 2 ]. (43) 

In the special case where p = q = this reduces to the previously 
obtained value (11). More interesting perhaps is the dispersion 
a 2 = 2? — Z 2 . This becomes 

a 2 = Anpql 2 . (44) 

The corresponding dispersion in x = L/21 is dearly 

4 - npq, (45) 

with the standard deviation 

Ox = Vnpq. (46) 


8. GENERALIZED LAPLACE’S FORMULA 


We now wish to find the more general Gaussian distribution 
formula for the case where p 9* g. For this we shall express P in 
terms of the deviation of tii from its average np. This deviation will 
be denoted by u, so that ti\ = np + u, and n 2 = nq — u. The 
problem is then to obtain an expression for (38), viz., 


P = 


n' 


{np -j~ «) ! {nq — u) \ 


p n p+ ,I q n i~ u i 


(47) 


in which the quantity « is freed from the bondage of the factorial. 
For the sake of variety we proceed somewhat differently than in Sec. 4. 
Taking the logarithm of P, we obtain 

log P = log n! — log {np + «)! — log(ng — «)! 

-f {np + u ) log p + {nq - u) log q. (48) 

Next we suppose that the factorials are all large enough so that we 
can apply Stirling’s formula in the form (cf. eq. 25). 

log nl = n log n — n -f ^ log 2xti. (49) 

We also take advantage of the expansion for the log (1 + x) where 
x<£ 1 (satisfied, e.g., by u/nq and u/np ), viz., 

log (1 + x) = x — I* 2 + • ■ • . (50) 

The expansion of (48) using (49) and (50) though a little lengthy is 
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perfectly straightforward and need not be set down here in full. After 
collection of terms we get 

. „ M 2 , . 1 . «(/> - g) , 1 « 2 . 1 «* 

"* F - - uk + 108 VSS + 4 + I i 5 ? • 

neglecting powers of « higher than the second. Examination discloses 
that the first term on the right is of greater order of magnitude than 
the third, fourth, and fifth. Consequently if we wish to keep only 
the highest order term in u, the result will be 



This is the generalized form of Laplace’s formula. It represents the 
normal or Gaussian distribution for the deviation of «i and n a from 
their average values, np and nq, respectively. For p = q — (51) 

naturally reduces to (27). It must be emphasized that (51) holds 
only for « small compared with np and nq. 


9. VOLUME DISTRIBUTION OF GAS MOLECULES. FLUCTUATIONS 

It will now be of interest to apply the analysis developed in the 
preceding sections to a somewhat more practical problem than that 
discussed in Sec. 1. We shall consider an ideal gas, which we shall 
assume to be composed of molecules in accordance with the elementary 
kinetic theory. In a volume V of gas, in a closed space containing N 
identical and indistinguishable molecules, the molecules will be moving 
about with varying velocities colliding frequently with each other 
and the walls of the containing vessel. If some one were to ask how 
many molecules there will be in a subvolume of 7, say V\, we should 
have to admit that in all probability the precise number will change 
from instant to instant and all we can hope to do is to assign an 
average value to the number Ni. How can this average be calculated ? 
Let us proceed in the simplest possible fashion by assigning the 
value Vi/V to the probability that a given molecule shall be in the 
subvolume Vi. This leads to certainty that the molecule shall lie in 
V, i.e., unit probability, while the. probability decreases as the size of 
Vi decreases relatively to V. Obviously there is no proof of this 
choice; it is only an assumption, but certainly a reasonable one. 
Similarly the probability that the molecule shall not be in Vi but in 
V 2 = V — Vi, is V 2 / 7. If we denote the probability Vi/7 by p 
and the probability Vi/Vby q (with p + q = 1) we should be able to 
Apply the considerations of Secs. 7 and 8. In particular the proba- 
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bility that N% molecules shall be in Vi and N 2 molecules in V a (with 
JVi + Nt ** N) will be simply 



NI 

N l \N 2 l 


p Nl q N \ 


(52) 


Applying Sec. 7, the average values of N x and JV 2 become 

Ni = Np, N a = Nq. (S3) 

Clearly there will be fluctuations from these average values. If 
neither Np or Nq is too small, i.e , if the subvolumes are not too 
small nor the gas too rare, we can immediately compute from Laplace’s 
formula the probability of a fluctuation u = Ni — Np. It is indeed 
given by (51) with N in place of n. This enables us to compute, for 
example, the probability that the fluctuation will equal or exceed a 
certain amount. Let us take an actual case, namely, V = 1 cm 8 , 
Fj = 10 -8 cm 3 , with N — 2.7 X 10 19 , the number of molecules per 
cnj 3 in an enclosed gas under standard conditions. Then p = 10~ 3 and 
Ni — 2 7 X 10 18 . Let us compute the probability that u/Ni shall 
equal or exceed 10~ 3 in absolute value, or 0.1 per cent. This will be 
given to close approximation by 


P = 


y/lirNpq Jio-*Ni 


f- 

JlQr*Ni 




with N - 2.7 X 10 19 , p = 10“ 3 , q m 1 - 10" 3 , N x 
To eval uate we change variables so that u a /2Npq ■■ 
vy/Npq, whence 


: 2.7 X 10 16 . 
1^/2 or# = 


■ ^ L 




t~*'H v = 4>(V2.7 X 10 10 ). 


For jc 1 , we have $(x) = s/2/r e I,/2 / x • (1 — 1/x 2 •••)• Conse- 
quently P here becomes approximately • 




-1 35X101° 


ir y/2.7 X 10 8 


This indicates that a fluctuation equal to or greater than the one 
indicated is very rare indeed. 

Another way of viewing the same problem is to calculate the stand- 
ard deviation of N\. From eq. (46) this is given by 

=» VNpq. 


9 


(54) 
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In the present example, <r = 1.64 X 10 8 , approximately. This is large 
in actual value but not in comparison with Of greater significance 
than a is the fractional or relative standard deviation, a/Nx. In the 
present case this is only 1.64 X 10 8 /2.7 X 10 16 10 -8 or 10~® 

per cent. The chance of finding such a small relative standard devia- 
tion by detecting density fluctuations experimentally would appear to 
be negligible. It might be supposed that if we could take a small 
enough subvolume the chance of detection would be greater. Thus if 
Vi = 10 -8 cm 8 , p = 10-®, JVj = 2.7 X 10 u ,<r~5 X 10° and <r/N x ~ 
2 X 10 -9 , which is some two hundred times larger than the former 
value though still small. As one looks around for possible means 
of detecting such a density fluctuation in such a small volume, one 
inevitably thinks of the effect of density change on the index of refrac- 
tion of light. 

The dependence of the index of refraction of a gas on the density 
may be written rather accurately in the form 

It - 1 + ap, (55) 


in which p is the density and a is a constant over a considerable range 
of variation. From this we immediately conclude that 


Am 


aAp 

1 + ap 


aAp, 


(56) 


since ap « 1 for gases. Thus for air the average p = 1.00029 approxi- 
mately for standard conditions, i e., ap 0 = 0.00029. Hence 


- = 0.00029 — - (57) 

M Po 

Suppose now we consider as our fundamental volume a cubic wave- 
length of the yellow light from sodium vapor, viz., that with an 
approximate wavelength of 6 X 10 -s cm. Then Vi ~ 2 X 10~ 18 cm 3 , 
Ni 5 X 10 s and f~2 X 10 3 with <r/N x ~ 5 x 10 -4 = Ap/po, from 
the definition of density. Consequently in this case 


A n 
M 


1.5 X HT 7 . 


In other words there should be a fluctuation of about one unit in the 
seventh decimal place of the average index of refraction of air for 
visible light. This is scarcely large enough for experimental detection. 
For light of shorter wavelength the fluctuation is somewhat increased 
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but is still very small. It has been shown by Smoluchowski * and 
others that for real gases near the critical temperature the above 
simple fluctuation theory (based on an ideal gas) is inadequate. 
Smoluchowski showed that for real gases the expression for <r/Ni 
given above is not correct. Actually analysis shows that the more 
nearly correct result is 


('W = 


RT 


dp 


N^ldp/dV]' 


For an ideal gas — = RT/V 2 and ( a/N j) 8 reduces to l/Ni or &* 


becomes N it which is the approximate result given in our work by (54). 


On the other hand for a real gas near the critical point where 


dp 

dV 


is 


— i i 

very small the value of {c/Ni) 3 can become very much larger than (54) 
would predict. Actually such gases at the critical point when illu- 
minated show an opalescence which has been attributed to the density 
fluctuation. 

The blue color of the sky has also been explained by density 
fluctuations like those considered here. We shall return to the prob- 
lem in a somewhat different form when we encounter the Brownian 
motion. 


10. THE SHOT EFFECT AS A FLUCTUATION PHENOMENON 

Another interesting illustration of fluctuation phenomena in physics 
is provided by the so-called shot effect. This explains the continual 
background of noise in a loud speaker actuated by the thermionic 
current in a vacuum tube in terms of the random emission of electrons 
from the cathode of the tube. This chance emission produces current 
fluctuations in the tube circuit. If we assume that the electron 
emission is completely unordered in the sense that the motion of each 
electron from the cathode is independent of that of any other, our 
statistical formulas should apply. 

Let us assume that over a very long time T the number of elec- 
trons emitted by the cathode is N, whence the expected average 
number per second is N/T. Actually the number N t in time intervals 
of the magnitude t (« T) will fluctuate from the expected average 
Nt/T, The fluctuation is given by 

u t = N t - y- (58) 

1 M. v. Smoluchowski, Ann. der phys 25, 205 (1908). 
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The foregoing theory then indicates that the standard deviation of 
the distribution is 


o 1 - yff (i (59) 


and the average relative fluctuation is 


<r 1 

Nt/T ~ VNt/T 


(60) 


B. Rajewsky 4 has been able to check this result by measuring the 
mean-square fluctuation in the tube -circuit current. Thus the average 
current over time T is 


h = 


§N 

T > 


while the actual current during any interval t is 


, eN t 

It = ~7‘ 

Hence the current fluctuation is 

a r r r «** 
A/ = ij — J 0 = — • 

The mean-square current fluctuation is 

J2 


IP = 


e 2 Nt 
* f 2 T 


lot 

t 


(61) 


In Rajewsky’s experiment he studied the emission of electrons 
from a special form of photocell constructed after the fashion of a 
Geiger-Mtiller counter. This allowed the counting of single electrons. 
In one particular case, for example, he counted 1,272 electrons in a 
period of 30 minutes. In our notation then T — 30 minutes and 
N = 1,272. This corresponds to an expected average of 42.4 elec- 
trons per minute. He observed the fluctuations from this figure over 
2-, 6-, 10-, and 20-minute intervals with results as follows 


t in Minutes 
2 
6 
10 
20 


i/Vjvi/r, obs. 
0 179 
0 061 
0 042 
0.019 


1/y/ Nt/T, CALC. 
0 109 
0 063 
0 048 
0.034 


The agreement in order of magnitude may be considered good enough 
* Phystk. Z. 32, 121 (1931). 
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to support the thesiB that the shot effect conies within the realm of 
our simple statistical theory. 

Allied to the fluctuation problem just considered is the fluctuation 
in electric charge which must continually take place in all bodies 
which according to the atomic theory of the structure of matter are 
assumed to consist of positively and negatively charged particles. 
Let us suppose that a given volume of matter consists of Ni particles 
with charge +e and N 2 particles with charge -e. If the material is 
electrically neutral we expect on the average that 2Vj = N 2 . How- 
ever, with large Ni and N 2 fluctuations are to be expected. Assume 
for simplicity that the charged particles are free and do not exert any 
influence on each other. The excess of positive over negative charge is 


S = e(Ni - N 2 ). 

The analogy between this and eq. (1) of this chapter should be suffi- 
ciently clear. If Ni + N a = N, the root-mean-square value of 8 will 
be (cf. eq. 12) ^ ^ 


which is a measure of the average fluctuation from complete electrical 
neutrality. The neglect of the electrical interaction between the 
charged particles makes this result too large and of questionable 
utility. Nevertheless it is conceivable that such fluctuations may 
some day be observed with sufficiently sensitive apparatus. , 


II. RADIOACTIVE EMISSION AS A FLUCTUATION PHENOMENON 
In Sec. 4, Chapter I, we commented on the phenomenon of radio- 
active decay as describable in statistical terms. If this is so, we ought 
to be able to apply the reasoning of Sec. 7 to the emission of a 
particles from radioactive substances. The following description of 
an early experiment by Rutherford will bring out the essential features. 

Using the scintillation method in which the flash produced by the 
impact of an a particle on a fluorescent screen is used to count the 
number of such particles emitted by a radioactive substance in a 
given interval of time, Rutherford in a certain experiment counted 
10,097 particles emitted over a period of 326 minutes. For conveni- 
ence he divided this period into 2,608 subintervals of minute each 
and noted the number of particles emitted in each submterval. The 
results of the count are given in the following table. 


Number of subintervals . 

57 

203 

383 

525 

532 

408 

273 

Number of scintillations 

0 

1 

2 

3 

4 

5 

6 

Number of subintervals. 

45 

27 

10 

4 

0 

1 

1 1 

Number of scintillations 

8 

9 

10 

11 

12 

13 

14 [ 
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This means that, e.g., there were 525 subintervals in which 3 scintil- 
lations were observed, while there were only 4 subintervals in which 
11 scintillations appeared. Now if this is really a statistical distribu- 
tion, the calculated standard deviation should agree with the observed 
standard deviation. The former is simply (cf. 54) 

a = s/Npq, 

where here N — 10,097 and N\ = Np = 10,097/2,608 *= 3.87 = aver- 
age number of scintillations per minute subinterval. Hence p — 
1 /2,608, q '"•< 1 and 

c = y/ 3.87. 


If now we compute the actually observed standard deviation, we have 


Cob. = 


57(3.87 - 0) 1 2 + 203(3.87 - l) 2 + • • • + 1(3.87 - 14) 2 


and get 


(Tobi = \/ 3.70. 


The agreement between «r oh , and a (about 5 per cent discrepancy) may 
be considered close enough to justify the use of statistical analysis in 
treating the problem* Of course this does not mean that we have 
a right automatically to use the Laplace formula or normal distribution 
law to describe the distribution in detail ; if the Laplace formula were 
to be fouhd not to apply, there would not necessarily be a contradic- 
tion. As a matter of fact, if we examine the assumptions on which the 
Laplace formula is based we see that it can be expected to hold only if 
neither Np nor Nq is too close to unity. Here, however, Np clearly 
violates this condition. Another type of approximation for Newton’s 
formula (38) is then clearly called for. 


12. POISSON’S FORMULA 

If in the expression (38) we substitute n\ — np = c, where c is 
a number of order unity we get 

'--c .)©■(- r 

This can be written in the form 

1 For more recent precision observations on radioactive decay, reference should 

be made to L. F. Curtiss, Bureau of Standards Journal of Research, 8, 339 (1932). 
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Now let it be assumed that »i « n. Hence each of the parenthetical 

expressions like (■ ~ — ^ is approximately unity and the same is 

true of (1 — c/n) ni . As we recall further that lim (1 - c/«)“ = e~ c 

n — * » 

and note that n\ is, of course, finite, we have in the limit as n — * oo 


P(»0 


e~V 1 

«il 


e~ np (np) ni 

Mi! 


(64) 


This is called Poisson's formula. It is applicable as long as m is 
small compared with n. When plotted the Poisson formula gives rise 
to a skew curve as distinct from the symmetrical normal curve of the 
Laplace distribution formula.® However as np gets larger the skew 
curve approaches the symmetrical one. The reader may show that 
the distribution expressed by (64) represents rather closely that 
observed in the radioactive emission experiment discussed in Sec. 11. 
For comparison the corresponding normal distribution should also be 
computed. 


13. THE THEORY OF ERRORS 

A review of the elementary applications of probability and statis- 
tics to physics would scarcely be complete without a reference to the 
theory of errors which is fundamental for the estimation of the validity 
of all physical measurements. 

In the performance of any quantitative experiment the aim is to 
secure maximum significance of the result by reducing to a minimum 
all extraneous disturbing influences. Thus in the experimental study 
of the relation between the pressure and volume of a gas at constant 
temperature it is essential that the temperature remain really con- 
stant throughout a whole series of observations of pressure and 
volume. This is a problem demanding precise experimental tech- 
nique. The great progress in accurate measurement has come from 
the development of such technique. When all precautions have been 
taken, however, it still remains true in every measurement that the 
apparently precise repetition of a particular operation under appar- 
ently identical conditions will rarely yield the same numerical result. 
What numerical value is then to be chosen to represent the quantity 
being measured? It is the task of the theory of errors to answer 
this question. 

•For a figure showing the relation between the Poisson formula and that of 
Gauss (Fig. 2 • 1), cf T C. Fry, “Probability and Its Engineering Uses," p. 239, 
D. Van Nostrand, 1928. 
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Let the quantity being measured be denoted by z and let a set of 
measurements of z under presumably carefully controlled experimental 
conditions be zj, z 2 , • • • z„. It is customary to ascribe the differences 
among the z’s to accidental errors as distinct from systematic errors 
which can be guarded against or accounted for by proper manipula- 
tion of the measuring apparatus. From the set of n values it is neces- 
sary somehow to produce a value which shall stand as the final “cor- 
rect” or acceptable one. It is most natural to assume that this will be- 
some kind of average of the z’s. The simplest type of average is the 
arithmetical mean, i.e., 


Zi + z 2 + • • • + z„ 

Z - • 

n 


(65) 


Let us for the moment adopt this as the acceptable value of the 
measured quantity. The quantities 

Ai = z, — 5 (66) 

then represent the fluctuations or deviations from the mean of the 
various measured values. It is an observed fact that if in any care- 
fully performed experiment we plot as ordinate the number of values 
as a function of the deviation from the mean, a frequency curve is 
obtained which, although it differs in detail for different experiments, 
nevertheless always possesses certain definite general characteristics. 
Strictly speaking, of course, the deviation from the mean is not a 
continuous set of values. What we do is to divide the total range of 
deviation into a set of equal intervals, i.e., from 0 to -fa, from -fa to 
-f 2a, etc. In the middle of each interval is plotted the number of 
measurements for which the deviation falls in this interval. If a 
smooth curve is passed through the resulting points as n is made 
sufficiently large, the result generally resembles the Gauss probability 
curve in Fig. 2 • 1 in the following respects : (a) there are many more 
values for which the magnitude of the deviation A is small than there 
are for which the magnitude of the deviation is large; ( b ) the number 
of values for any particular positive deviation interval tends to approxi- 
mate the number of values for the corresponding negative deviation 
interval. Thus there always tends to be a maximum in the curve 
near A *= 0 though, of course, there may also be subsidiary but lower 
maxima. 

If there were no accidental errors involved in the measurement we 
should expect the same value z to result from every observation. 
The differences among the z's may therefore be called errors and the 
frequency curve above described may be called an error curve in 
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which the ordinate gives the number of cases in which the error lies 
within a particular interval, say from A to A + dA. By division 
with n, the ordinate may further represent the probability of an error 
lying within the given interval. The “normal” or Gaussian error 
curve is then represented by the equation 

P(A)dA = ~ e~ h '*dA, (67) 

V v 

where P(A)dA is the probability that the error of an observation shall 
lie in the interval A, A + dA. The quantity h is called the “measure 
of precision.” Its value depends on the spread of the measurements. 

An important task of the theory of errors is to show under what 
conditions the expression (67) is justified. Many derivations of the 
normal law have been given based on a variety of fundamental 
hypotheses on the nature of accidental errors 1 We shall not repeat 
any of these here but shall only show the intimate connection between 
the normal law and the arithmetical mean. This will indeed involve 
the demonstration that the assumption that the most probable value 
of a measured quantity is the arithmetical mean, leads directly to the 
normal law. 

Let us suppose that the probability that the error in a measured 
quantity shall lie in the interval A, A + dA is f(A)dA, where /(A) is 
the error function whose form we are seeking. If the n observed 
values of the quantity in question are Zj, z 2 , • z n » respectively, and z 
is assumed to be the actual “correct” value, the errors in the various 
measurements are 

Aj = Zj — z, A 2 = z 2 — z, • • • A n = z B — z. (68) 

The probability that an error shall lie in an arbitrarily small but 
definitely assigned region in the neighborhood of A, is 

m a.), 

where K is a constant representing the size of the region. It is strictly 
the value of dA„ but we are practically agreeing to take all dA, ot the 
same size and call their magnitude K. The probability that in the set 
of n measurements we shall make the n errors A, will then be 

P = K n UJ(A t ). (69) 

ft 


» For one such derivation, cf. Lindsay and Margenau, “Foundations of PhyBics,” 
pp. 181 ff. 
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Since z has been assumed to be the “correct” value, the only meaning 
we can give to this statement is that P shall be a maximum for z. The 
condition for this is, however, 


From (68) we have 


1 d/(A,) dA, 

2-a /(A.) dA, dz 



for all s. 


(70) 

(71) 


Hence the maximizing condition is 

1 d/(A,) 

frf/(A.) dA,- 


(72) 


Let us now assume that z is the arithmetical mean. The condition 
(72) is then subject to the auxiliary condition 



A< = 0. 


(73) 


By the method of Lagrange’s multipliers (described and used in Sec. 2 
of Chapter IV) we can now express the conditions (72) and (73) in 
the form 



1 d/( A,) 

A, /(A,) dA, 


+ x)a, = 0, 


(74) 


where X is an undetermined multiplier. In order to satisfy this condi- 
tion we have to set, for all i 


d/(A,) 

dA, 


XA,/(A,). 


(75) 


The solution of this set of differential equations is 

/(A,) = Ce -XA,,/2 . (76) 


By employing the condition that P shall be a maximum and not a 
minimum it can be established that X is positive. Equation (76) is 
equivalent to the normal error law. It only remains to evaluate C. 
Since all errors must lie between — so and +oo, we must have 


/>+« /*+• 

/ /(A)dA = C 
*/— 00 00 


*- X4,/2 dA - 1. 


(77) 
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This fixes 



If we let X/2 = h 2 , we have 

/(A) = ^ e " WA ’* (79) 
which is equivalent to (67). 

The arithmetical mean has another important property. Let U 9 
form the so-called “residuals" by subtracting z from each measured 
value. Thus we have the set of quantities 

r, = z, - z. 

Now note that if we formed similar quantities for any other value, say 
z, we should have 

n n 

2> - z) 2 + n{z - z) 3 . (80) 

t - l »-i 

When z = z, the sum of the squares of the residuals is least. This is 
the basis of the so-called method of least squares. 

The significance of the parameter h 2 in the normal error law 
becomes greater when we inquire as to the average error in a set of 
measurements of a single quantity. There are various ways of defin- 
ing such an average, just as we found many ways of defining average 
deviation in the earlier sections of this chapter. Perhaps the most 
valuable average is the mean square average error. This is defined as 



The evaluation of the integral (cf. eq. 32) gives 


A 2 



(81) 


The root-mean-square average error is then 



(82) 


This serves to reinforce the meaning of h as a measure of precision, 
since the larger h is, the smaller is the root-mean-square error in a series 
of measurements of a quantity. 

The question now arises: Can we give an estimate of the error 
involved in the arithmetical mean itself? We can interpret (69) as the 
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probability that any particular value z shall be the “correct” value 
from the set of measurements zi • • • z„. This probability can be 
written 


POO - 



e -h>2 <*,-*)* 


(83) 


But from (80), this can be put into the form 
P(z) 


(84) 


We can include e with the multiplicative constant and write 

P(z) = Ce~ nhK, -' l) \ (85) 

C is evaluated in the usual fashion, i.e , 


~+<*> 

•S—ao 


-nhH.-i)*, _ 


dz « 1, 


whence 




( 86 ) 


(87) 


The probability associated with the arithmetical mean is therefore 
from (85) . 

POO ( 88 ) 


This means that the probability that the arithmetical mean shall 
represent the “correct” value for a quantity grows with the square 
root of the number of observations of the quantity. We can go 
further and express the me'an-square error associated with the arith- 
metical mean z in the form 
- +e0 

J n (s - zfp{z)dz = ~ (89) 

The root-mean-square error associated with the arithmetical mean is 
thus 

73 ' 7s m 

Since we have already seen that the root-mean-square error associated 
with any one of the measured values is ~~ry i (eq. 82) it follows that the 
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arithmetical mean is \/» times as accurate as any one of the measured 
values. 

The statistical theory of errors is now a very large subject to 
which we cannot begin to do justice in the above brief survey which 
has confined itself exclusively to the normal law and has disregarded 
other types of error distributions found in practice. A good review 
of the whole field will be found in the article by W. E. Detning and 
R. T. Birge, "On the Statistical Theory of Errors,” in Reviews of 
Modem Physics, 6, 119 (1934). 

PROBLEMS 

1. Compare Stirling's formula for the integers 1 to 10 inclusive with the exact 
values of n I. Compute the absolute and percentage errors involved in the use of the 
formula. Do the same for log n I and in addition find the percentage error (for the 
range of n a bove specified) involved in using simply log n! => n log n-n, i.e., neg- 
lecting log y/irn. 

2. In connection with the higher order average deviations from the mean in a 
distribution of n objects, prove that when k is even 

v'j? - [i.3 s ••• (*- 1)]*'* 

It 

3. Use Stirling's formula directly to transform Newton’s formula to Laplace’s 
formula, i e., eq. (26). 

4. In a coin-tossing experiment 10 coins (U. S. one cent pieces) were tossed 
1,100 times and the distribution of heads and tails noted after each toss. The results 
of this and another similar series of tosses are presented in the following table. For 
the interpretation of the table it may be remarked, for example, that there were in 
the first series 198 tosses giving 6 heads and 4 tails and 207 such tosses in the second 
series. Compute the expected distribution from the algebraic formula (2'). Then 


Heads 

Tails 

Series 1 

Series 2 

10 

0 

a 

0 

9 

1 

8 

11 

8 

2 

53 

45 

7 

3 

125 

134 

6 

4 

198 

207 

5 

5 

291 

274 

4 

6 

237 

233 

3 

7 

123 

136 

2 

8 

53 

51 

1 

9 

11 

7 

0 

10 

1 

2 


Total 

1,100 

1,100 
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calculate the average deviation from the expected mean (without regard to sign) as 
well as the standard and probable deviations. Compare these with the actually 
observed deviations in the two series of experiments. Comment on the results. 

Noting that in the first series the total number of heads was 5,448 and the total 
tails 5,552, whereas the expected mean would be 5,500 of each, calculate the prob- 
ability (eq 36) of a deviation as large as that observed. Do the same for the second 
series and then for the results of both series taken together. Draw from this what- 
ever conclusions you deem reasonable and plausible 

5. In Problem 4, plot (1) the expected distribution from the algebraic formula 


for 



(2) the expected distribution from Laplace’s approximation (26), and 


(3) the actual distribution Do this for both series. 

6. Carry out the analysis leading to the generalized Laplace'B formula (eq 51). 

7. Apply Smoluchowski's expression for <r^/~N i to the case of carbon dioxide 
at 31.89“ C (cf. Int. Crit Tables for data) Do the same for 100° C. Compare the 
result in each case with that obtained from the simple formula (54) 

8. Use Rajewsky’s observed results on the shot effect to obtain the root-mean- 
square current fluctuation in his experiment Could this scheme be used as a method 
of determining the charge on the electron? 

9. Plot the radioactive emission data of Sec. 11 as well as the corresponding 
Laplacian and Poisson distribution formulas and compare the experimental and 
theoretical distribution in each case. 

10. The median of a set of measured values is that value such that there are as 
many greater than it as there are less than it Find the law of errors corresponding 
to the assumption that the median is the most probable value of the measured 
quantity. 



CHAPTER III 


REVIEW OF THERMODYNAMICS 

1. FUNDAMENTAL CONCEPTS 

In Chapter I we stressed the difficulties associated with the classical 
dynamical method of describing physical systems containing a large 
number of constituent particles. The alternative, the statistical point 
of view, is the one which will be followed out in detail in this book. It is 
well to recall, however, that there is a well-known dynamical method 
of describing the behavior of physical systems, particularly with rela- 
tion to heat. This is thermodynamics. It has been remarkably suc- 
cessful in correlating a vast amount of empirical data concerning the 
thermal changes of bodies. Its program has been accomplished without 
postulating a molecular constitution for physical objects; hence it 
has avoided the above-mentioned obstacles in the path of the precise 
application of dynamics to systems of many degrees of freedom. Since 
the object of much statistical reasoning in physics is to provide a basic 
theory in terms of which the facts of thermodynamics find a rational 
explanation, it is desirable at this place to review the fundamental 
concepts of this subject. 

Thermodynamics is a discipline which endeavors to describe the 
behavior of large scale bodies, particularly with reference to their ther- 
mal changes, by the use of dynamical concepts. As has already been 
emphasized, however, the method of attack is quite different from that 
of classical mechanics. Instead of visualizing a body as a system 
composed of a large number of material particles, whose motion is 
sufficient to account for its behavior, we consider the body as a whole, 
i.e., macroscopically. In particular its state is no longer defined in 
terms of component particles but rather in terms of large scale quan- 
tities, operationally defined. These are volume, pressure, and temper- 
ature, which are termed the fundamental state variables. Only two of 
these are independent, since for every body there exists a so-called 
“equation of state” (cf. Sec. 3) connecting them. Thermodynamics 
does not pretend to relate pressure, volume, and temperature for all 
conditions of bodies, but only those in which the system if left to itself 
will remain unchanged, i.e., what we shall call states of equilibrium. 
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Whenever the same state of equilibrium is reproduced the state vari- 
ables return to their previous values. In other words, they depend on 
the state alone and not on how the system got into the state. 

All thermodynamic changes of state are called processes. They may 
be reversible or irreversible. A process is reversible when by an 
infinitesimally small change in the parameters controlling the state, 
the system may be made to pass in either direction through a sequence 
of states, without any net change in the surrounding environment. All 
other processes are irreversible (cf. Sec. 3, Chapter I). A process that 
carries a system through a sequence of states back to the initial state is 
called a cyclic process. The Carnot cycle is a familiar illustration. 

Thermodynamics employs other concepts besides state variables. 
Thus in a cyclic process, a certain quantity of heat may be absorbed 
and a certain quantity of work done by the system. After the system 
has come back to its original state these quantities do not return to 
their original value; in fact there is no meaning to this statement. 
Quantity of heat absorbed or given up by a system is not a state vari- 
able; neither is quantity of work done by or on the system. Both 
these quantities depend vitally on how the system goes from its 
initial to its final state. It is clear that for a thermodynamical variable 
to be a state variable it must be expressible in terms of the funda- 
mental variables of state in such a way that any small change in it is a 
perfect differential of the corresponding changes in the state variables. 

2. THE TWO LAWS OF THERMODYNAMICS 

With these preliminaries out of the way we shall now recall that the 
theoretical basis of thermodynamics consists of two principles, called 
the first and second laws. The first law of thermodynamics comprises 
the assumptions that heat is a form of energy and that in any thermo- 
dynamic process there is conservation of energy. This can be written 
in symbolic form by the introduction of a new thermodynamic state 
variable, the total internal energy of the system, which we shall denote 
by E. The principle says that if a quantity of heat A Q is added to a 
system and a quantity of work A W is done by it, the associated change 
in the internal energy A E is given by 

AE = A(? - A W. (1) 

We have already emphasized that Q and W are not variables of state 
or state functions. However, it is part of the content of the first law 
that £ is a state variable. In other words, if a system is allowed to 
undergo a series of thermodynamic cyclic processes from a definite 
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initial state back to this same state, although A Q and AW may be 
quite different for the different processes, experiment indicates that 
A Q — A W is zero for all the processes, thus suggesting that AE is 
always zero for a cyclic process and therefore that E is a state variable. 
This experimental suggestion is erected into a definite postulate. We 
therefore use dE to denote a change in E, implying that it is a perfect 
differential of the fundamental state variables, while we shall continue 
to use A Q and AW to bring out that these are not perfect differentials. 
It is scarcely necessary to emphasize that in (1) all quantities are 
expressed in energy units by the use of the mechanical equivalent of 
heat, approximately 4.2 joules per calorie. 

The first law is too general to serve by itself as the single basis of 
thermodynamics. It is necessary to supplement it by another principle 
whose purpose is to express the direction in which thermodynamic 
processes take place. We have already commented in Chapter I on the 
irreversible nature of many natural processes, particularly those in 
which heat transformations are concerned. The principle (1), how- 
ever, will apply just as well to reversible as irreversible processes. We 
need therefore a state variable which changes only in one direction, i.e., 
either never increases or never decreases. Such a quantity is found in 
the entropy which is defined by its differential, i.e., 


dS - AQ/T. 


( 2 ) 


In this definition it is understood that A Q is the change in heat energy 
in a reversible process. The function 5 defined in this way is a state 
variable, though Q is not. This can be shown by the generalization 1 
to the case of any reversible cycle of the fact that if in any Carnot 
cycle heat AQi is taken in at temperature T i and heat AQ 2 is given out 
at temperature T 2 , we have 


A(?i _ AQ 2 
Ti T 2 


(3) 


In order to state the second law of thermodynamics we need one 
more concept, that of a closed system. This means a system which has 
no interaction with its environment, i.e., it cannot gain energy from 
nor lose energy to its surroundings. With this and the foregoing in 
mind we can state the second law in the form: the entropy of a closed 
system never decreases. It may not change (as in a reversible cycle) 
but if it does change, it must increase, and this is always true for 

‘See, for example, Leigh Page, “Introduction to Theoretical Physics,” (second 
edition), p. 289. D. Van Nostrand Co , New York, 1935. 
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irreversible processes. In the latter case it is not possible to calculate 
the change in entropy from (2) : it is necessary to replace the irreversible 
process by a reversible one which has the same initial and final states. 
Then (2) can be applied. The reader will recall several conventional 
ways of stating the second law, e.g., that it is impossible for a self- 
acting engine continuously to convey heat from a body of lower temper- 
ature to one of higher temperature. All such statements may be 
shown to be logically equivalent to the one we have given above. 

The entropy may be used to give an alternative formulation of the 
first law (1). Thus 

dE = TdS — AW. (4) 

As we have already emphasized, the work AW depends on the nature 
of the process. However, it may be thought of as owing to the change 
in certain parameters £i £„ which express the dependence of the 
system on its external surroundings. If the change in ( } is associated 
with a generalized force F } , the work done in the change is F } d£ } and 
we can write (4) in the form 

n 

dE = TdS ~^F } d^. (5) 

i-i 

It must be pointed out that eq. (5) does not refer to a closed system 
since it contemplates interaction of the system and its environment. 
Hence in (5) dS need not be zero or positive as is required by the 
second law for a closed system. 

It cannot be too strongly emphasized that the changes symbolized 
in (5) are those that take place between equilibrium states. Neverthe- 
less since E and S are state variables, eq. (S) will apply even when the 
change from one equilibrium state to another takes place irreversibly, 
i e., through a senes of non-equilibrium states. Can we talk about the 
entropy of a system when it is not in an equilibrium state? Certainly 
we cannot compute it by eq. (2), since that applies only to a reversible 
process and a closed system cannot reach a non-equilibrium state by a 
reversible process. It can, however, reach an equilibrium state from a 
non-equilibrium state by means of an irreversible process in which the 
entropy will increase. Hence we can say that the entropy of a closed 
system in a non-equilibrium state is less than its value in the equi- 
librium state toward which it proceeds. This is the basis of the state- 
ment of the second law in the form: the entropy of a closed system 
tends to a maximum value or the entropy in an equilibrium state is a 
maximum with respect to its value for all non-equilibrium states from 
which the given equilibrium state can be reached by irreversible 
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processes. Naturally the closed system, if in an equilibrium state, 
will not proceed by an irreversible process to a non-equilibrium state ; 
this is inherent in the definitions of equilibrium state and irreversible 
process. 

3. FREE ENERGY AND OTHER THERMODYNAMIC FUNCTIONS 

From the entropy, internal energy, volume, pressure, and tempera- 
ture other state variables can be formed. The most important of 
these are the following : 

(а) Helmholtz free energy (or Gibbs’ 4' function) 

= E - TS. (6) 

In future applications, when we speak of “free energy” it will be this 
function which is meant. 

(б) Enthalpy (or Gibbs’ X function) 

X = E + pV. (7) 

(c) Gibbs free energy (or Gibbs’ Z function) 

Z = E + pV - TS - X - TS « ¥ + pV. (8) 

For the statistical interpretation of thermodynamics the Helmholtz 
free energy is the most important, though for most applications of 
thermodynamics, the Z function is more significant. As eq. (8) 
indicates, Z and 'k are closely related. 

From (6) we have 

d$i — dE — TdS — SdT. (9) 

But (5) can be used to transform this to 

d* = - SdT - di y (10) 

3 

For an isothermal process (dT = 0) eq (10) says that there is a 
decrease in free energy equal to the external work done by the system. 
If the only way in which the system can do work on its surroundings 
is to change its volume against external pressure p and if this process 
is isothermal, we have 2 F } d£, = pdV and hence (10) becomes 

d<Sf =- pdV, (11) 

or 



( 12 ) 
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This important relation is called the equation of state of the system. 
If ¥ can be found as a function of V and T, it enables us to connect 
p, V, and T for the system. As an illustration, we shall later see that 
a statistical analysis yields for the free energy of an ideal gas of N 
particles occupying volume V at temperature T 

¥ = — NkT log 7 + K, <(13) 

where k is the so-called Boltzmann gas constant ( ,k = 1.37 X 10 -1# 
erg/degree C) and K is an arbitrary additive constant independent of 
volume but not necessarily independent of temperature. The com- 
bination of (12) and (13) gives 

pV — NkT (14) 

as the equation of state for an ideal gas. 

4. SOME THERMODYNAMIC RELATIONS 

Important thermodynamic relations can be deduced from the fact 
that dE, d'P, dX. and dZ are perfect differentials of the state variables 
Thus from (10) it follows that 



From the first of the expressions in (15), the free energy itself can be 
written in the form 

*- E + r (%\ (16) 

This is sometimes called the Gibbs-Helmholtz equation. By differen- 
tiating S in (15) partially with respect to and F, with respect to T 
we obtain 



When the reduce to a single parameter, namely V, and the corre- 
sponding Fj becomes p, this is known as one of Maxwell’s thermo- 
dynamic relations * Three other relations can be derived by express- 
ing in similar fashion the fact that dE, tfX and dZ are perfect differ- 
entials. They are set down here for convenient reference. The first 
is (written in general form) 



1 Cf op. cu , p 296. 



Sec. 5] 


r£sum£ 


47 


The other two are usually written only for the case in which Fj reduces 
to p and ^ becomes V. Then they are 

(~) =(~) 

\dp/s \dS/p 

(19) 

(-) =-(-)■ 

\dp/T \dT/p 

(20) 


The heat capacity is an important concept which we shall meet 
later. By definition the heat capacity of a substance at constant 
volume is the heat absorbed per degree change in temperature at con- 
stant volume, or 

The last step came from the first law (5) with = pdV. We can 

also define a heat capacity at constant pressure, thus: 

^-(iX-(ffl + ( s "'iX (22 > 

If the parameters reduce to the volume alone and F, becomes the 
pressure p, (22) may be written 

M£X + '(!fX 

The general formula 

(fX-dWi-a 

becomes the well-known formula 

C p - C r = Nk 

for the special case of an ideal gas. When applied to one gram of sub- 
stance, the heat capacity becomes the usual specific heat, denoted by 
cy and Cp. 

5. r£sum£ 

With the basic principles discussed in the preceding sections of this 
chapter it has proved possible to give a coherent description of a large 
number of physical and chemical phenomena. This is not a textbook 
of thermodynamics and hence we shall not pursue the purely thermo- 


(23) 

(24) 

(25) 
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dynamic method of description in a detailed fashion. 1 Rather we are 
now interested in seeing how the statistical point of view provides an 
interpretation of thermodynamics in terms of the atomic constitution 
of matter. Several different ways of doing this have been devised. 
Those which we shall consider in this book are: (a) the classical sta- 
tistics of Maxwell and Boltzmann; (6) the classical kinetic theory; 
(e) the statistical mechanics of Gibbs ; (d) the statistical mechanics of 
Darwin and Fowler; ( e ) the quantum statistics. In each case the 
statistical theory strives to set up a number of statistical quantities 
analogous to the state variables of thermodynamics and beginning 
with very general postulates to derive a set of relations among them 
which can be interpreted as physically equivalent to the thermo- 
dynamic relations we have just discussed The hope exists, further- 
more, that the statistical point of view will provide an even -deeper 
understanding of physical phenomena than thermodynamics by sug- 
gesting laws which are not susceptible of thermodynamic derivation. 

Our method of procedure will be, to a certain extent, an historical 
one ; we shall examine the older physical statistical methods first. This 
is the natural order, for the more recent quantum statistics employs 
the same fundamental ideas as the earlier statistics and it will be desir- 
able to have the latter firmly in mind before proceeding to the former. 
It is hoped that in this way the reader will get a clearer view of the 
whole subject than if we tried to adopt a unified point of view and 
abandoned the historical approach entirely. We shall indeed find that 
the different methods of presenting classical statistics lead essen- 
tially to the same result when applied to the same problem. Some 
may therefore take the stand that a discussion of all methods is super- 
fluous. On the other hand the greater the number of ways in which we 
can look at a problem the more profound and thorough should be our 
understanding of it. 

PROBLEMS 

1. In the schematic diagram A and B represent equal volumes A is occupied 

1 1 by a mass m of an ideal gas at pressure p and temperature T, while B 

A B is a perfect vacuum. Calculate the change in entropy which results 
when a hole is opened in the partition between A and B, allowing the 
gas to move freely from A to B 

'At this point the reader may wish to consult any one of a number of standard, 
more or less elaborate, treatments of thermodynamics, e g., “Textbook of Thermo- 
dynamics," by P Epstein, John Wiley and Sons, New York, 1937; “ Thermody- 
namics, ”byE Fermi, Prentice-Hall, New York, 1937, "Heat and Thermodynamics," 
by M. W. Zcmansky, McGraw-Hill, New York, 1937. Of particular interest to 
physicists in view of the discussion in the later chapters of the present volume is 
P. W. Bridgman's "Thermodynamics of Electrical Phenomena in Metals,” Mac- 
millan, New York, 1935. 
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2. Compute the change in entropy involved in mixing 1,000 grams of water at 
80° C with 500 grams of water at 15“ C. Assume that the specific heat of water is 
constant and equal to 1 cal/ gram degree C. 

3. Two grams of hydrogen have an initial volume corresponding to a pressure of 
76 cm of Hg and temperature 20° C. The volume being kept constant, the gas is 
heated to temperature 80° C. It is then allowed to expand at constant tempera- 
ture to double its original volume. How much heat has been absorbed by the gas 
and how much work has been done on it? Determine the same quantities when the 
gas is first allowed to expand to double its volume, the temperature being kept con- 
stant at 20° C, and is then heated at constant volume from 20° to 80° C. In each 
case also calculate the total change in the internal energy of the gas. 

4 . From eq. (17) of this chapter derive the Clausius-Clapeyron equation, viz., 

where L *= latent heat of vaporization and (AF,)r is the change in specific volume 
between liquid and vapor phases at constant temperature T. 

If the specific volume of water at 100° C is 1 cm 8 /gram, while that of steam is 
1,686 cm*/gram find the change in the boiling point of water produced by lowering 
the pressure by 10 cm of Hg in the neighborhood of 76 cm of Hg 

If the difference in specific volume between the liquid and solid phases of water 
at 0° C is 0 1 cmVgram, find the depression of the freezing point of water associated 
with an increase in pressure of one atmosphere 

5. It is shown in Chapter IV (eq 84) that the free energy of an ideal gas with 
N particles in volume V at temperature T is 

Use eq (16) of this Chapter to find the expression for the total energy of the gas. 

Given one mole of an ideal gas under standard conditions of temperature and 
pressure. Find the change in free energy if the volume is doubled at constant tem- 
perature Find the change in free energy if the temperature is raised by 1° C while 
the volume is kept constant 

6. Derive the expression for the entropy of an ideal gas from the application of 
Problem 5 to eq (15) of this chapter 

7. By applying eq (25) of this chapter to the special case of hydrogen, show how 
the mechanical equivalent of heat may be calculated 
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1. STATISTICAL DISTRIBUTION OF N OBJECTS IN jt GROUPS 


The fundamental problem of what has come to be called the 
Maxwell-Boltzmann statistics is the following: Given a large number 
of objects, N, e.g., molecules of a gas, it is desired to distribute 
these with respect to some property they all possess, e.g., position in 
space, velocity or kinetic energy. It will be convenient to think of 
this property as associated with a set of n boxes, with a definite value 
of the property attached to each box. We shall first assume that the 
objects are indistinguishable, that they move freely and exert no 
forces on each other, and that it is just as likely that a particular 
object shall lie in one box as in any other. We can then readily com- 
pute the probability of an arrangement in which there are N x objects 
in the first box, N 2 in the second, N s in the third, • • ■ , and N„ in the 
pth, by finding the number of independent ways in which this distri- 
bution can be achieved. From eq. (2') of Chapter II the number of 
ways of choosing JVj of the N objects to place in the first box is simply 

( 


N\ = N\ 

Ni) Ni'(N - Ni)\ 


Similarly the number of ways of choosing N 2 objects out of the remain- 
ing ( N — N x ) to place in the second box is 


(N - N,\ = (. N - N x ) ! 

\ Ni ) n 2 \(N - N x - N 2 )\ 


Clearly then the total number of ways required is the product 



- Ni\ fN-Ni-Na 

N* / A A. 


which can be reduced to the following form 

\ N ] = N\ 

UJ NilNal - ■ N„l' 


( 2 ) 

( 3 ) 


For simplicity we shall use the square bracket to denote this expression. 
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The probability associated with the above distribution will be J 

divided by the total number of ways of distributing the N objects 
among the n boxes without regard for the number of objects in each 
box. By a simple generalization of the argument in Sec. 1 of 
Chapter II, the latter number is m^- Hence the probability desired 
is simply 



( 4 ) 


Equation (4) can be immediately generalized to the case where the 
a priori probabilities of each box are not equal. Suppose that they 
are actually given by gi, g a , • • ■ g M , respectively, where, since we here 
assume that these are actual mathematical probabilities, we have 


■ !• 


(S) 


i - 1 


Then the probability that N x objects are in the first box, N a in the 
second, etc., if there were only one way in which to realize this distri- 
bution, would be 


gl gl 


i!!'. 


Since however the number of ways of making this distribution is 


m 

II, the total probability becomes 

rf - 


( 6 ) 


This reduces to (4) when gi = ga =•••=£„ = 1 /m- 

We now ask the following question: Given an aggregate of N 
objects whose number remains constant, for what type of distribution 
among m boxes will P„ have the maximum value? Intuition answers 
the question by saying that the required distribution is that for which 
the number in the jth box is 

N, = Ng,. (7) 

Let us see whether we can confirm this conjecture. The problem is 
to make P* a maximum subject to the condition 



= N = constant. 


( 8 ) 


Choose any set of N, satisfying the condition (8). If we can find the 
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conditions which gi • • • g, • • • g? must satisfy in order to make 
a maximum subject to (5), we shall have solved the problem. We 
write (6) in the form 


logP? = log 



+ SiV, log g,, 


( 9 ) 


and find it simpler to make log a maximum subject to the con- 
dition (5). But log P% is stationary if any slight but arbitrary 
variation in the g, leads to zero variation in log P*. The meaning 
of log P% assures that in the present case the stationary value will be 
a maximum. Hence the mathematical formulation of the problem is 


subject to 


S log P” = 0, (10) 

£ «& = 0 , ( 11 ) 


where we use the symbol b to denote an arbitrary variation, 
then have 




We 

( 12 ) 


subject to (11). Let us solve (11) for bgi in terms of the rest 



Substitute this into (12) and obtain 



(13) 


(14) 


In this expression Sg 2 , • • - bg^ are completely arbitrary, since by giving 
them any consistent values (proper fractions, of course) we can still 
choose 6gi to satisfy the condition (11). Hence the only way to satisfy 
(14) is to have identically 


N i 

_ A* 

ii 

Si 

g2 

& 


N. (15) 


The fact that the constant ratio is N is, of course, a result of (5) and 
(8). This checks the intuitive deduction. 

We have indeed carried out the deduction (15) somewhat indi- 
rectly. In actuality the g, are fixed quantities and the distribution 
sought is the set of Nj for which P„ is a maximum subject to (8). 
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Suppose, however, that we have the correct set of N } . If we substi- 
tute them into (9), to secure the required maximum we must still 
have (15) satisfied. As a matter of fact we could start with (9) and 
vary the N } subject to (8) keeping the g } fixed. This involves using 

Stirling’s formula for the factorials in j^J. It will yield the result 

(15), as the reader can show, but the method we have used is simpler. 

The reader should see the connection between the distribution 
given by eq. (15) and the special one treated in Sec. 9 of Chapter II 
and expressed in eq. (53) of that chapter. 

2. THE CANONICAL DISTRIBUTION 

We now wish to generalize the distribution problem of Sec. 1 by 
assigning to each box a certain property which will be possessed by any 
object in that box To fix the ideas dearly, let us imagine that the 
objects are material partides and the property is kinetic energy. We 
again assume that the particles are free. The energy E, will be 
assigned to the jth box: whenever a particle is in that box it possesses 
precisely this energy value. The total energy of the aggregate of 
particles is thus E — 'LNjE,. The problem is now to distribute the 
particles among the boxes in such a way that the distribution proba- 
bility P„ will be a maximum subject not only to the condition of 
Sec 1, viz , that the total number of particles remains constant, but 
also to the additional condition that the total energy of the aggregate 
remains unchanged. It will be assumed that the boxes have the 
a prion probabilities gj • • g M as in Sec 1. 

As before we shall work with log P%. From (9) using (3), we have 

log P" = log N ! — 2 log N, ! + S N, log &. (16) 

The analytical formulation of the problem is 

3 log P% = 0, (17) 

subject to 

3SJV, = 0; SZN& = 0. (18) 

In all sums, unless otherwise specified, it will be assumed that j runs 
from 1 to ii. 

From Stirling’s formula (eq. 25 of Chapter II) 

log Nl = N log iV — iV + ^ log 2ir + ^ log N. (19) 

If iV is suffidently large the terms log 2t and log N are negli- 
gible compared with the first two terms on the right. We shall con- 
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siatently neglect them for the sake of simplicity. This might seem to 
be an invalid procedure in log Nj ! when Nj is small, as it can quite 
possibly be. However the error involved will still be small even here, 
because the terms log N, ! for small Nj make only a very small con- 
tribution to the whole sum. Condition (17) now becomes 



and conditions (18) 

28N } - 0; TiEjSNj = 0. (21) 

Note that the E, are definitely fixed and the only possible variation 
in energy comes from the &N } . The maximization problem is now car- 
ried out by the use of Lagrange’s undetermined multipliers. 1 We 
choose the initially arbitrary constant multipliers yj and y a and, 
remembering (20) and (21), write 

S (log ^ + 7i + **) 8N, - 0. (22) 

We now pick yi and y 2 to satisfy the equations 


log + 7i + 72 -Ei = 0, 
log 7T + 7i + 72^2 = 0. 


(23) 


This we have a right to do, since 71 and 73 are arbitrary. Equation 
(22) now becomes 



log jj- + 7i + 72£> 


) 


& N, 


0 , 


(24) 


and the variations 8N 2 , • ■ • , 8N„ occurring here are now completely 
arbitrary. For all j, therefore, we have 


l°g^= — 7l — 72-E/, 

or 

Nj = & e T,+T **'. (25) 

1 Cf., Leigh Page, "Introduction to Theoretical Physics, ” second edition, p. 311. 
D. Van Nostrand Co., New York, 1935. 
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It is now convenient to replace 71 and by two new parameters 
and 6 defined by the equations 


71 - Q + log Np, 


Ya - 


1 

e' 


(26) 


The distribution formula (25) then takes the form 

N, = pg,Ne+ /6 e- s > /9 . (27) 


The parameters and 0 can at once be determined, at least in prin- 
ciple, as follows. We have from ( 8 ) 

2 N, « = N, 

whence 

*=- 01 og 2pg,e-*> /B , (28) 


which gives ^ in terms of 0. To get the latter consider the total energy 


E - Nc* /9 ■?,pg 3 E ] e- B ' /(i 


= N 


2pg,E,e-*’'* 

ZWT** ' 


(29) 


In this way we see that 4 1 and 0 are expressible in terms of E, N, gj 
and Ej. The actual evaluation of and 0 from these transcendental 
equations cannot, of course, be carried through in closed form. We 
shall hope, however, in the subsequent discussion to give them a 
physical interpretation. Incidentally (29) also provides an expression 
for the average energy per particle in the distribution, viz., 


- E _ 'L b E i e- B > /% 
* = N~ 2 g) e- E > /B 


(30) 


The distribution defined by (27) is usually termed a canonical 
distribution. Since 

*-*'* = 2 Hg,e- S > / *, (31) 


an alternative way of expressing this type of distribution is clearly 


N, = 


N gj e- s ’ /e 


(27') 


This eliminates the parameter \J/ from the distribution formula. The 
parameter 0 is termed the distribution modulus. It is clear that it 
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must have the dimensions of energy and so must p. We have said 
nothing so far about the sign of 0 which might be considered arbi- 
trary. However, from (27') it is clear that negative 0 would lead to 
indefinitely large values of N, for increasingly large E,. As a matter 
of faht, investigation shows that 0 must be positive if condition (17) 
is to correspond to a maximum rather than a minimum of logP*. 
By disregarding g„ positive © makes the number of particles in the 
jth box decrease exponentially with the energy assigned to that box. 
Of course the distribution is affected by the choice of a priori proba- 
bilities gj. 

Our understanding of the canonical distribution law (27') will be 
enhanced by the consideration of a few special illustrations. First 
suppose that g, = l//i for all j and E, = E/N for all j. Inspection of 
(27') for this case shows that for all j 



corresponding to a uniform distribution. If we forego the restriction 
on g ]t save for Zg, = 1, the resulting distribution becomes 

N, = Ng „ 


i.e., the same as that already studied in Sec. 1. 

For the second illustration imagine that E, is an integral multiple 
of a certain fundamental energy unit 6, i.e., E, = (j — 1)8 where j, 
as usual, takes the values 1,2,3, .... Further assume that p is so 
large that we can effectively consider the sums over j from 1 to p as 
infinite series. For simplicity we shall suppose that the gj are all 
equal. Therefore since e -e/e < 1 


2 e -E,/6 _ 2 e -0- I)e/e _ 


1 

(1 - e~ B/ »y 


Moreover by a simple extension of this 
2(j - 1 )E-e~ <J ~ 1)B/9 = 


Se -E/e 

(1 - e~ E/6 ) 2 ' 


In the sums j runs from 1 to » . 
average energy 


Consequently (30) yields for the 


E = 


fTWZTi 


(32) 


We shall later have occasion to appreciate the larger significance of 
(32) (cf. Sec. 10, Chapter VIII), but for the present let us use it as a 
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means of getting a somewhat clearer light on 0. 
yields 


0 = 



Solving (32) for 0 
(33) 


Suppose that S < < E, which is not too far-fetched an assumption. 
Then we have the interesting approximation 

© ~ E, (33') 

or the modulus © is approximately equal to the average energy per 
particle. We shall investigate later a generalization of (33'). From 
(28) we now have for the parameter p 

S . 1 

* log (1 + 8/E) g 1 - <r (i +«/» ’ 
which yields approximately 

E 

p ~ — £ log — • (34) 

O 


If we choose to set = Np, we can further write this 




—E log — 


in terms of the total energy. The distribution formula for this special 
case is 

N, = jVe -<,-I5e/8> (1 - «T e/8 ), (35) 

or approximately 

N 3 ~ (1 - e~ Ne/E ). (36) 

An illustration of more practical importance than the above is pro- 
vided by a collection of linear simple harmonic oscillators, all with 
frequency v. It is shown by quantum mechanical reasoning 2 (also 
cf . Chapter VIII) that the possible energy values of a linear harmonic 
oscillator are given by 

£, = (;- i)fo, j- 1.2,3, ••• (37) 

where h is Planck’s constant of action (6.55 X I0“ 27 erg sec). The 
a priori probabilities are all equal. Assuming again that the number 
of states is very large, we have 

p -h,/2» 

2 e ~ E > /9 = 75. 

* Cf. Lindsay and Margenau, “Foundations of Physics,” p. 430. 


( 38 ) 
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Consequently the distribution function takes the form 
N, = Ne- (J - I)h,/B -(1 - e~ k ' /B ) 


(39) 


which is identical with (35) with S = hv. However, a somewhat dif- 
ferent and very interesting situation presents itself with regard to the 
average energy per oscillator. If we differentiate (38) with respect to 
8 the result is 

2£,e"* ,/e = y • e~ k ’'* B • (40) 

and from (30) this gives 


-_hv (1 + e~ k ’ /B ) hv hv 

2 (1 — e~ h,/B ) 2 e k,/B — 1 


(41) 


The significant feature of this result is that, unlike (32), the average 
energy no longer vanishes when 0 becomes zero. In fact 


(-E)e-o = y = Ei. 


(42) 


We shall later learn to call this a “ zero-point ” energy. It is clear 
that this situation will always arise when the lowest possible energy 
value is different from zero as it is in the present case. 

In the collection of simple harmonic oscillators considered above 
the a priori probabilities are all equal. It turns out that if the collec- 
tion contains -two dimensional harmonic oscillators, this situation no 
longer prevails. Rather the g } increase linearly with j, so that we can 
write 

&=£. (i= 1,2,3...-) 

where C is a constant depending on the number of energy boxes. 
Division by this constant is necessary to make g, a probability in the 
sense in which we are using the term. Actually this constant plays no 
role so far as the average energy is concerned. Irrespective of the 
value of C, eq. (30) gives for the average energy 


- ZjE 3 e- E ’ /B 
h ~ 2 je~ E >' B ' 

Quantum mechanics yields for the two-dimensional oscillator 


(43) 


Consequently 


E, = jhv. 

- _ hvLj 2 e~ ,k ’ /B 
2 j e -> k ' /B 


( 44 ) 
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Now by differentiating Se~ ,h,ye = l/(e hw/e - 1) with respect to B, 
we secure 


.h'/e 

3 - _ 1)2 * 
A second differentiation yields 

e h,/e (e h,/B + 1 


Consequently 


,*».-*/• _ + 1) 
J “ (e*’ /e - l) 3 ' 


7h„ 


(45) 

(46) 

(47) 


This should be compared with (41) and the change in the “ zero- 
point ” energy noted. 


3. PROPERTIES OF THE CANONICAL DISTRIBUTION. INTERPRETA- 
TION OF THERMODYNAMIC RELATIONS 

It is now in order to study some properties of the canonical distri- 
bution and to indicate analogies with the important thermodynamic 
relations. 

Let us evaluate the expression for log P^ (eq. 16) in the case of a 
canonical distribution. We have 

log P* = N log N - N — 2 (iV, log ~ - n j)’ (48) 

Substitution of N, from (27) yields ultimately on reduction, if we 
denote the canonical value of P^ by P e , 

log n N P e = — - q — = log w, (49) 

where we have again replaced N>p by ’i r ', and where in what follows 
we shall always consider n N P e replaced by w. We shall refer to the 
latter as the “ statistical probability ” for a canonical distribution. 

We shall now imagine that the system of particles undergoes a 
change in its total energy, taking place in two ways, viz., (a) by a 
change dE } in the amount of energy associated with each box, and 
(6) by a change in the distribution over the boxes, the change in the 
number of particles in the jth box being denoted by dN } . The total 
change in energy then appears as 

dE = dXNjE, = HEjdN, + 2N } dE } . (50) 

There is another way of looking at thi9 which is rather expeditious 
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and moreover will prove very useful when we come to alternative 
presentations. 

Let us introduce the transformation 

£ = e" 1/e . (SI) 


We can then write from 


(29) 

E ~ z»«,r £ ' ' 


(52) 


We shall moreover find it convenient to denote 2 by a special 
symbol and use Z for this purpose. Thus 


z = ZM&r*’. 

Now 

dZ v 

f- = 

and therefore 

Z df 


(53) 

(54) 

(55) 


This proves to be a very useful mode of expression for the energy of 
the system. We may call Z the distribution function. Later we shall 
find something very like this called the partition function in the method 
of Darwin and Fowler (Chapter VII). We shall now consider the 
total change in E associated with a change in the modulus f (or @) 
and the changes in K the latter being lumped in the change in Z. 
Now we shall assume that the function Z depends on f as well as on 
certain external coordinates, namely the £i • 4 • {„ already mentioned in 
connection with the thermodynamical treatment in Sec 2, Chapter III. 
When we carry out the differentiation with this in mind we get for 
the total change in E 


dE 


-( 


N ™xl + N( s ‘ l °e z 




er 


«*> 


Next we proceed to consider the work done in the change in the 
external coordinates For a particle in the jth state in which the 


dE 

energy is E } our alteration in brings into play a force and 

the work done by this particle when all the parameters change by 
d(i, df g • • d( n is 
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Now the average number of particles in the jth state is N„ given by 
(27) or (27'). which by employing (53) can be written in the form 


N, = 


nNg,^> 


(58) 


Consequently the contribution to the work by all particles in the jth 

E dE, 

— — d£x and that by all systems in all states is clearly 




(59) 


From the fact that 


3£x V 
we can write 


x J 


dW = 


JV 


3 log Z 


logfV 


dh 


(60) 

(61) 


Therefore the change in the total energy of the system plus the external 
work done by the system is 


dE + dW 


= N 


slogZ +f^k 


<“> 

Let us now go back to the expression (49) for log w. From (28), 
(51) and (53) this may be written 

log w = — E log f + N log Z (63) 

The change in log w when 0 and the £x alter therefore is 


m 


. d log Z 


d log w — — dE log f ~ + N . - x > 3 £ x 


dt + N^ 


d log Z 


dfx- (64) 


From (55) it is clear that the two middle terms in (64) cancel each 
other, so that finally 

E d log Z , 

— Tt- — ^x- 
x °fx 
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If we divide through by — logf we get 


d logu> 
logf 



d log Z 


di\. 


But from (51) and (61) this at once becomes 

0 d log w = dE + dW. (65) 

This very interesting relation evidently bears a close analogy to the 
first law of thermodynamics (eqs. 1 and 4, Chapter III). We can 
indeed look upon it as the statistical equivalent of the first law if we 
are willing to identify the increment of quantity of heat transferred 
to the system of particles as Qd log w. Let us then write 

AO 

— = d log w. (66) 


Now the statistical quantity log to has the property that its value 
depends solely on the state of the system in its canonical distribution 
and does not depend on how the system got into that state. It is 
therefore competent to serve as a statistical analogue of a thermo- 
dynamical state variable. In fact we see from (66) that if we interpret 
the statistical canonical distribution modulus 0 as a universal con- 
stant k times the absolute temperature T, we can write * 

AO 

d[k log w] = — • (67) 


Comparison with eq. (2) in our discussion of thermodynamics in Chap- 
ter II suggests that we interpret the left side of (67) as the change in 
entropy of the system and write 

d[k log w] — dS. (68) 

The integration of (68) then yields for the statistical interpretation of 
entropy 

5 = k log w + C, (69) 


where C is an arbitrary additive constant of integration. We are 
entitled to choose |or C the constant quantity which will make (69) 
agree best with the known thermodynamical properties of the entropy. 

' h turns out to be the Boltzmann gas constant 'with the value 1 37 X 10 -u 
ergs/°C (Cf. Secs. 1 and 2, Chapter V). 
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With this in mind we find it advantageous to choose C = —k log N\ 
and write our statistical definition of entropy in the form 

S = k log (^j)' (69') 

We may indeed proceed to define w/Ni as the “ effective ” statistical 
probability for the canonical distribution. Examination of the pre- 
ceding sections of this chapter discloses that if we had used it in place 
of w no change would have resulted in the canonical distribution. In 
Chapter VIII we shall see a close connection between the definition 
(69') in classical statistics and the quantum mechanical definition of 
entropy. It is well to emphasize that the relations (68) and (69') are 
dependent on the fact that the system of particles is canonically dis- 
tributed. Since the equations of thermodynamics refer to systems in 
equilibrium it is therefore appropriate to assume that the canonical 
distribution is the statistical analogue of equilibrium. This corre- 
sponds well with the fact that log P c (and likewise log w) is a maximum 
for a canonical distribution as compared with any other distribution 
of the same system of particles with the same energy. The same is of 
course true for log (iv/N !). If there were a configuration of greater 
probability than the canonical distribution we should expect that the 
system would not rest in equilibrium until it had attained this more 
probable state. On this view a system with a fixed number of par- 
ticles and given energy not in an equilibrium state will correspond to 
a smaller value of log ( w/N ')• But such a system will tend to approach 
a state of greater probability and this is the statistical interpretation 
of the irreversible tendency of the entropy of a closed system to 
increase; the latter has already been emphasized in the thermo- 
dynamic definition of entropy in Sec. 2, Chapter III. There is, to be 
sure, a fundamental difference between the second law of thermo- 
dynamics and the law based on the assumption (69) or (69'). Accord- 
ing to the second law, the entropy of a closed system always increases 
in a non-cyclic, irreversible process. According to the statistical inter- 
pretation it is only probable that the entropy will increase. From the 
very nature of the statistical definition of entropy there is no necessity 
that the entropy must always increase under the conditions stated. 
There will always be indeed a finite probability that it will decrease 
This may be looked upon as the price we have to pay for the statistical 
interpretation. 

We can illustrate the above situation by attempting to treat the 
change in entropy during a given process, as statistically defined in 
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(68), as a measure of the irreversibility of the process. Suppose the 
system changes from state 1, where the statistical probability is w, 
to state 2 with probability w + Aw. It seems appropriate to take 
the measure of the irreversibility in going from 1 to 2 as the ratio 
( w + Aw)/w. We shall call this z. Then 

w 4- Aw 

log s = log — — — = A log w, 

whence 

z = e Aloi " = e AS/k . 

As an example, consider the process involved in the passage of 1 erg 
of heat energy from a body at 21 °C to another body at 20°C brought 
in contact with the first. If we replace the process by an equivalent 
reversible one, i.e., one carried out ideally with infinite slowness, we 
can calculate the entropy change AS by (67) and get on substitution 
of the data AS/k = 8 S X 10 10 . Hence in this case z > 10 lol °, indi- 
cating a high degree of irreversibility. It is rather interesting to con- 
sider what happens when the quantity of heat energy transferred is 
only 10~ u erg. Then z = e 086 , which is less than 3. Indeed as the 
amount of energy transferred becomes smaller and smaller, z —* unity 
and w for the two states approaches the same value. For very small 
energy transfers the process becomes less and less irreversible from the 
statistical standpoint. This casts a further interesting light on the 
significance of the statistical interpretation of entropy. 

It is now not a difficult matter to find the statistical analogues of 
the other important thermodynamic variables. Consider again the 
Helmholtz free energy (eq. 6 of Chapter III) 

¥ = E - rs. 

With the statistical interpretation of 5 this becomes 

¥ = E — kTXogw + kTlogNL (70) 

But now compare this with eq (49). Let us set as usual 

0 = kT. (71) 

Then from (28) and (49) and the use of Stirling’s formula 

-NkT (log | + l) (72) 

is the statistical expression for the free energy in terms of the parti- 
tion function as we have introduced it in cq. (53). This of course 
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assumes that the constant k, whose value is not specified by the 
statistical theory, is chosen correctly (cf. Secs. 1 and 2, Chapter V). 

We should be able to use the expression (72) to derive the equation 
of state of the system from eq. (12) of Chapter III. Thus the pressure 
should be given by the equation 


= NkT 


d log Z 
dV 


( 73 ) 


If therefore we were to derive the expression for the partition function 
Z for a system of free particles and in particular its dependence on 
the physical volume occupied by the system, eq (73) should reduce 
to the well-known equation of state of a perfect gas, viz., 

pV — RT, (74) 

where R is the so-called gas constant Now it is clear from the form 
of (73) that in order for it to yield (74) it is essential that 

Z = KV, (75) 

where K is independent of V, though it may contain the mass of the 
particles, the parameter ©, etc , and must have the dimensions of 
reciprocal volume. In the next section we attempt the calculation of Z. 


4. DISTRIBUTION OR PARTITION FUNCTION FOR A SYSTEM OF FREE 
PARTICLES. FREE ENERGY OF AN IDEAL GAS 

The problem is to obtain for an ideal gas the volume dependence 


of the function 




Z = 

(76) 

where 




E > ^ 2 m ^ ' 

(77) 


All particles have the same mass, m, and the p JX , p )V and p )t are the 
component momenta along the x, y, z axes respectively of a particle 
in the jth box. Since the gas is assumed to be ideal, the energy is 
kinetic only. Before we can use (77) to evaluate (76), we must intro- 
duce the appropriate a priori probabilities g,. From our previous 
discussion (cf. Chapter II, Sec. 9) it is reasonable to assume that gj is 
proportional to the size of the jth box. Clearly, however, size here 
does not refer merely to physical volume but also to momentum 
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interval as well. Let us suppose that a particle in the jth box has its 
configuration coordinates in the volume element A*, A y, A about the 
rectangular coordinates x 3 , y„ z 3 , and its momentum components in 
the momentum interval Ap )X Ap )y Ap„ in the neighborhood of the 
momentum components p )x , p )y , p Jt . It is then natural to suppose 
that g, will be equal to Ax } Ay 3 Az } Ap }X Ap }V Ap ]t divided by the total 
“ volume ” of the six-dimensional space defined by the coordinates 
x i> V» s :< Pix< Piv< Pi* and whose total extent is given by the range of 
variation allowed to these coordinates. We have now to decide whether 
the coordinates shall be allowed to vary continuously or in discrete 
amounts. We shall make the assumption, which will prove of value 
in the later discussion of quantum statistics, that the six-dimensional 
space under consideration has a cellular structure in which the cell 
has volume h 3 , where h is a fundamental constant having the dimen- 
sions of momentum times displacement. This assumption means that 
no matter how much x„ y„ z 3 , p )Xl p JU , p Jt vary among themselves 


Ax, Ay } Az 3 Ap }X Ap JV Apj , > h 3 . 


(78) 


We shall now assume further that there are y cells available. Hence 


finally 


= Ax 3 by 3 Az 3 Ap n Ap 3V Ap 3Z 
gl M 3 


B2S 


Consequently the partition function becomes^ ? ^ ^ 

Z “ £3 ^ e ~^ +P ^ + ^ /2mB ^Ayi^APizAp 3u Ap 3X . 


(79) 


( 80 ) 


Now since the coordinates are all independent of each other and since 
the exponential term does not involve any configuration coordinates 
we can carry out the summation over the Ax„ etc., independently of 
the momenta and have 

^Ax,Ay,A z, = V, (81) 


where V is the physical volume occupied by the particles. Since the 
Apix, etc., can be made as small as we please subject only to condition 
(78), we shall assume that to a very high degree of approximation 
(h being assumed to be a very small constant) we can replace the 
summation over the p JX , etc., by a triple integration. It should be 
emphasized that this replacement of a finite sum by a definite integral 
is here purely a matter of mathematical convenience and has nothing 
fundamentally to do with the essential nature of the Maxwell-Boltz- 
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mann statistics. In the evaluation of the integral we shall further 
suppose that the limits for each variable are — oo and + oo respec- 
tively. Actually there will be finite maximum and minimum values of 
the momentum components for any actual aggregate of particles. 
However the exponential integrand renders the choice of infinite limits 
a very acceptable approximation (cf. the remarks after »eq. 30 in 
Chapter II). Now by a simple transformation and extension of eq. 
(23) of Chapter II, we have 


/•+* >•+<■ /> 
mJ — 00 •/ — 00 •/ -P 


+• 

e -(4+pi+«*>/2m «dpjp<dp. = (2«n0) ,/ *. 


(82) 


Hence v 

Z = • (2 1 rw0)‘ / *. 


(83) 


This is indeed of the form of eq. (75) in the last section and thus leads 
to the correct equation of state for an ideal gas by substitution into 
eq. (73). From the form of the partition function for an ideal gas as 
obtained in (83) it results that the free energy (72) is an extensive 
quantity, i e., at given temperature it is directly proportional to the 
number of particles and does not depend directly on the volume of the 
system. In fact if we substitute from (83) into (72), the result for 
the free energy of an ideal gas is 

.-«r[*,(r.a^) + i]. <m> 

If two systems are combined so that the density and temperature 
remain the same the free energy of the combination is the sum of the 
free energies of the individual systems. It is again assumed that the 
constant k is chosen correctly. 


5. EQUIPARTITION OF ENERGY IN A SYSTEM OF FREE PARTICLES. 

ENTROPY OF AN IDEAL GAS 

We can at once extend the results of the preceding section in an 
interesting fashion to obtain an expression for the average energy per 
particle of the system. If we differentiate 

Z = Zng,e~ E,/B = • (2irr»0) ,/ ' 

with respect to 0 we obtain 

i - g (W A e' A . 


(8S) 
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Hence (eq. 30) 


Hg ] E J e~ B,/e 

E ~ 



( 86 ) 


This significant result gives the average energy directly in terms of the 
distribution modulus. It is possible to interpret the equation as express- 
ing the equipartition of the kinetic energy among the various degrees 
of freedom of the individual particles. Each of the latter has three 
degrees of freedom. If we assign each degree of freedom on the aver- 
age the amount of energy 0/2, the total per particle is 30/2, which 
is the equivalent of (86). As a matter of fact this interpretation is 
confirmed by computing the average energy per particle for the com- 
ponent motion along the x axis only, which at once yields 0/2. From 
the symmetry of the integrand in (82) the same result will follow for 
the y and s degrees of freedom. We reach therefore the general con- 
clusion that on the average the kinetic energy of an aggregate of free 
particles in a canonical distribution is distributed equally among their 
degrees of freedom. It is important to emphasize that the result (86) 
depends for its validity on the approximation of sums by integrals. 
We shall see that this is true for any statistics in which discrete cells 
or boxes are employed. The situation is rather different in the method 
of Gibbs, which will be treated in Chapter VI. We should further 
emphasize that the system to which the equipartition principle applies 
is an aggregate of independent particles, possessing kinetic energy 
only. In any case we shall find the equipartition principle of con- 
siderable value in our future discussion of aggregates of particles. 
In particular we shall use it in connection with the kinetic theory of 
gases which will be reviewed in the next chapter. 

It will be of interest to apply eq. (86) to the derivation of the 
entropy of an ideal gas. From the definition (69') we have 


But 


5 = k log w — kN log N + kN. 
k log w = E/T + Nk log Z, 


(87) 


where E is the total energy of the gas, which from (86) becomes 
E = ZNkT/2 (with 0 replaced by kT, as before). The use of (83) 
coupled with substitution into (87) yields finally for the entropy of 
the ideal gas 

o 5 , tmT1 \V (2irmifer)’ /, l 

5 = -WV + WVlog^- • (88) 

It follows that the entropy is an extensive quantity in the same sense 
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aa the free energy, as discussed, in Sec. 4. The reader should note 
that this result would not have been obtained if we had defined the 
entropy simply as S = k log w. It is the choice of the additive con- 
stant — k log N ! which assures that S shall have the extensive prop- 
erty. 4 


PROBLEMS 

1. Derive the distribution law N, = Ng, (eq, 7 of this chapter) by varying the 
N, subject to 21V/ ■= i, while keeping the g, fixed. 

2. Find the expression for the free energy of the set of independent linear simple 

harmonic oscillators whose possible energy values are given by E, <= (j — J 4)h», 
(j ■= 1,2,3 • ) (cf. eq. 37). Do the same for a set of two-dimensional oscillators 

for which the average energy is given by (47). 

3. From the partition function Z for a set of independent linear simple harmonic 
oscillators derive the expression for the entropy 

4. Show that if the entropy is defined as 5 = k log w, it is not an extensive quan- 
tity for any ideal gas and that the same is true of the corresponding free energy 

5. Derive the expression for the Gibbs free energy, ¥ + p V, for an ideal gas. 
Do the same for the enthalpy (cf Sec. 3, Chapter III) 

6. In a canonical distribution of N free particles find the expression for the 
number of particles having kinetic energy included between E and E + dE Find 
the ratio between the number of particles with energy in the range dE about the 
average energy E = 30/2 and the number of particles with energy m the same range 
dE about an energy differing by 1 per cent from E. 

7. Compute the “ effective statistical probability ” for the canonical distribution 
of one mole of an ideal gas under standard conditions 

8. The methods of this chapter may be applied to an aggregate of independent 
particles which move in an external conservative force field characterized by the 
potential function y, s). F ind the general form of the partition function of such 
an aggregate. Specialize to the case in which the force field is the constant gravi- 
tational field near the surface of the earth 

‘This point is well brought out in Mayer and Mayer, “Statistical Mechanics,” 
pp 114 ff John Wiley & Sons, 1940 
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THE KINETIC THEORY OF GASES 

1. THE VHtlAL AND THE EQUATION OF STATE OF AN IDEAL GAS 

It is assumed that the reader is acquainted with the fundamental 
assumptions of the kinetic theory of gases from at least an elementary 
point of view. It is the intention of this chapter to review briefly 
its chief results, emphasizing in particular the points of contact with 
the statistical theory of the preceding chapter. 

The kinetic theory envisages a gas as composed of a very large 
number of material particles called molecules moving with widely 
varying velocities in all directions and colliding with each other and 
with the walls of the confining vessel. The collisions with the walls 
are assumed to be responsible for the pressure of the gas while the 
average kinetic energy of the molecules is connected with the observed 
temperature of the gas. The first fundamental task of the kinetic 
theory is to provide a theoretical deduction of the equation of state 
of the gas, i.e., the relation connecting pressure, volume and tempera- 
ture (cf. Sec. 3, Chapter 3). There are several ways of attacking 
this problem. 1 The one presented here is somewhat different from 
that used in elementary books, being based on the so-called virial of 
Clausius. 

We shall confine our attention first to an ideal gas, in which the 
molecules are mass particles in the form of elastic spheres with radii 
very small compared with their average distance apart, so that indeed 
their dimensions can be neglected. They are assumed to be free 
particles exerting no mutual forces and all having the same mass m. 
We shall suppose their number is N and number their coordinates 
x„ y«. in some inertial system with the subscript i which runs from 
1 to N. Consider the following function of the coordinates 

N 

M = % + y? + z?). (1) 

1 Cf. Sir James Jeans, “Dynamical Theory of Gases,” Chapter VI. Cambridge, 
1930. The reader may also consult with profit other recent books on kinetic 
theory, e.g., L B. Loeb, “Kinetic Theory of Gases,” Chapters II and V, McGraw-Hill, 
New York, 1927; and E H. Kennard, “Kinetic Theory of Gases,” Chapters I and V, 
McGraw-Hill, New York, 1938. 
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This does not appear to have immediate physical significance. But 
let us differentiate with respect to the time. Using the dot notation 
for time differentiation, we obtain 

AT 

M - + ysj/, + s,i,). (2) 

i-i 

A second time differentiation gives 

M + 5? + i?) + -f y,y, + ' (3) 

.-i *-i , 

We shall now proceed to form an average of a somewhat different 
nature from any hitherto considered in this book, namely an average 
over the hme. The coordinates, component velocities and component 
accelerations of the particles are of course functions of the time. 
Hence the quantity JUT will depend on the time. Its time average is 

M =* - f Mdt = - (M T - Mo), (4) 

T J 0 T 


where M 0 is the value of M at the initial time t - 0 and M, its value 
at time r. Since the system of particles is confined to a finite space, 
the x t , y t , 2 , coordinates are bounded at all times. The component 
velocities x t , ju, i t will also be bounded Consequentlythe difference 
M t — Mo in (4) is bounded and therefore as r increases M r —*0. From 
this we conclude that 

~y~ s 

+ jj) = - i Y]m(x,x t + y t y, + n,s,), (5) 

-- 1 * TT7 


where the averages are taken over a sufficiently long time. The 
quantity on the left-hand side is the lime average of the total kinetic 
energy of the system. The quantity on the right-hand side may be 
written in slightly different form if we replace mi, by Fx,, the x com- 
ponent of the resultant force acting on the tth particle. Similar 
replacement of the y and z component accelerations yields for the 
right-hand side 


0 



(x.Fx, + y t F Vl + z,F h ). 


( 6 ) 


The quantity Q was called by Clausius the virial of the system. The 
relation (5) then states that for a system in canonical distribution the 
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total average kinetic energy is equal to its virial. This is the virial 
theorem. 

Let us now calculate the virial for an ideal gas by considering that 
the force components which enter eq. (6) are merely those involved 
in the collisions between the molecules and the walls. This means 
that F Xt , etc., have non-vanishing values only for the values of *„ y„ z, 
at the walls. These forces arise because of the change in momentum 
experienced by the molecules in their reflection from the walls. It 

might seem difficult to compute 
them; but we must recall that 
it is the time average which is 
involved in eq. (6). The average 
wall forces can be effectively re- 
placed by the integrated pressure 
which according to the postulates 
of the kinetic theory is indeed 
the average effect of the continual 
bombardment of the walls by the 
molecules. Figure 5 1 represents 
a portion of the surface of the 
containing vessel. Let us con- 
sider the area element n dS in this surface, where n is the unit vector 
normal to the surface at dS. This element has the position vector 

r = ix + jy + kz (7) 

in the chosen system of rectangular coordinates. The average force 
exerted on the system by the element of area is a vector directed oppo- 
sitely to n and of magnitude pdS, if we denote the pressure of the gas 
by p. The components of this force along the axes are then 

— pdS cos a, — pdS cos 0, — pdS cos y, 

where cos a, cos 0, cos y are the direction cosines of n. The summa- 
tion in (6) over all the molecules is now conveniently replaced by an 
integral over the whole surface of the containing vessel. Conse- 
quently we have for the virial 

Q = ^ f (x cos a -fr- y cos 0 + z cos y)dS. (8) 

Since by definition 

n = i cos a + j cos 0 + k cos y, 
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we can transform (8) to 

n = |y*r ndS. (9) 

We can now employ the divergence theorem (often called Gauss’ theo- 
rem) in vector analysis ! to write 


Q = 2 f VtdV < (10) 

where the integral is now the volume integral of the divergence of r 
over the whole volume of the containing vessel. But 


Hence finally 


V-r 


dx dy dz 
dx dy dz 


n = %pv. 


(ii) 


From the virial theorem we therefore draw the conclusion that the 
total average kinetic energy of the molecules of the ideal gas is equal 
to ipV/2, or 

i = | PV. (12) 


Having reached this point we cannot go further toward the actual 

equation of state without introducing a macroscopic interpretation 
± 

of E. There are two possibilities. In the first place, we may arbi- 
trarily but plausibly assume that the total average energy character- 
izes the temperature of the gas, so that constant temperature implies 

t 

constant E. With this association (12) becomes at once Boyle’s 
law for an ideal gas. If we wish to be more specific we can introduce 
the statistical considerations of the preceding chapter and assume 
that the molecules are canonically distributed with respect to their 
kinetic energy. To be sure this is passing beyond the methods of 
kinetic theory as commonly understood, but it forms a useful bridge 
between the kinetic and statistical points of view. We shall then 

j_ 

suppose that the total average kinetic energy E is the same as NE, 
where N is the number of molecules and E the average energy per 
molecule in a canonical distribution. But in Sec. S of Chapter IV we 
have shown that 

NE = %Ne. 

1 See Leigh Page, “ Introduction to Theoretical Physics," second edition, p. 32 
D. Van Nostrand Co., New York, 193S. 
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Hence (12) becomes 

pV=NQ. (13) 

But the equation of state of an ideal gas is known to be 

PV - RT, (14) 

where T is the temperature on the Kelvin scale and R is the gas con- 
stant appropriate to the gas in question and its mass. If (13) is to 
be the kinetic-statistical analogue of (14) it is dear that we must have 

e = ^r. (IS) 

In words, the modulus of the canonical distribution is directly pro- 
portional to the Kelvin temperature of the gas and the coefficient of 
proportionality is the gas constant per molecule. This is the q uan tity 
which has received the name of Boltzmann’s gas constant k, i.e., 


It 

(16) 

II 

© 

(17) 


With this assignment the kinetic statistical derivation of the equation 
of Btate of an ideal gas may be considered complete. 


2. SOME SIMPLE KINETIC THEORY PROPERTIES OF AN IDEAL GAS 


The simple kinetic theory described in the preceding section leads 
to some interesting properties of an ideal gas. We shall review these 
briefly here. 

If we write the kinetic energy in the form 


i- 1 i - 1 L 


it follows that we can express the average in 
square velocity 

v m =^? 

as follows 



terms of a root-mean- 
(18) 
(19) 


Equation (12) then becomes 

pV - iNmiZ,. 


( 20 ) 
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The value of v m can be estimated very easily from the fact that the 
density of the gas is given by 


whence 




( 21 ) 

( 22 ) 


The substitution of the atmospheric pressure in dynes/square centi- 
meter and the density of hydrogen under standard conditions (7* = 
273° K, pressure 76 cm of Hg) yields v m = 1,900 meters/sec approxi- 
mately for hydrogen, in so far as hydrogen can be considered an ideal 
gas. Actually v m for an ideal gas does not depend on the pressure. 
From (14) we readily write 



indicating that for a given gas v m depends solely on the absolute tem- 
perature. Precise knowledge of the mass of a single molecule, e.g., 
that of hydrogen, then suffices for the determination of k. Using the 
value i»h, = 3.32 X 10~ 24 gram leads to 

k = 1.37 X 10 -18 ergs/degree C. (24) 


Coming back to the ideal gas law in the form 

pV = NkT, (25) 

we observe that since A is a universal constant, N is the same for 
equal volumes of all ideal gases at the Bame temperature and pressure. 
This is the law of Avogadro. In particular since the chemical molec- 
ular weights are proportional to the actual molecular masses, it 
follows that the number of molecules in a gram molecule or mole of 
any ideal gas is a universal constant. This is the Avogadro number, 
now generally given as 

N A = 6.03 X 10 23 , (26) 


whose best evaluation is from electrochemical data, namely the value 
of the Faraday Q or charge necessary to evolve one chemical equiva- 
lent of any element in an electrolytic process coupled with the funda- 
mental electric charge e. We have indeed 

Q = N A e, 

and with Q = 9,650 emu and e = 1.6 X 10~ 2 ° emu, N A is found to 
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be 6.03 X 10 s3 . The Avogadro number is not, of course, confined to 
an ideal gas. According to atomic theory it is a genuine universal 
constant giving the number of molecules in a mole of any element or 
compound. 

No difficulty should be experienced in deducing from the above 
simple considerations other important results of elementary kinetic 
theory, including Graham’s effusion law, according to which the 
effusion velocity through small orifices of a gas at constant tempera- 
ture and pressure varies inversely as the square root of the density, 
and Dalton’s law of partial pressures that the pressure of a mixture 
of two gases is equal to the sum of the pressures which each would 
exert individually if alone in the same volume. 

The kinetic theory also has something of interest to say about 
the specific heats of a gas (Sec. 4, Chapter III). The total average 
energy per gram of an ideal gas whose molecules are single mass par- 
ticles is from (19) 


_ & = 3 kT 
2 2 m 


(27) 


Consequently the specific heat at constant volume is 

/3£m\ 3 k 

" ‘ VW) v ~ 2 m 


(28) 


Multiplying both numerator and denominator of (28) by Na gives 


cv = - 


3 kN A 


M 



(29) 


where M is the molecular weight and R m the gas constant per gram of 
gas. For the rare gas argon, for example, R m = 2 1 X 10® ergs/gram 
degree and hence (29) yields (in terms of the more familiar calories/ 
gram degree C) 

cy = 0.075 cal /gram degree C. 

This agrees very closely with the experimentally observed value at 
room temperature. 

For a gas whose molecules consist of more than one mass particle, 
we can no longer assume that the average energy per gram is given by 
(27), since die internal kinetic energy of the constituent parts of the 
molecule relative to its center of mass must be considered. We may 
account for this by writing in place of (27) 

Em-( l+0)j, 


(30) 
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where 0 represents the ratio of the average internal kinetic energy to 
the average kinetic energy of translation of the center of mass of the 
molecule. It will have the same value for all gases whose molecules 
have the same constitution. We then get 

c v = \( l+»^- (31) 


For example, if a molecule consists of two particles of equal ma« 
rotating about some axis perpendicular to the line joining them, it 
develops that 0 = % so that 


5 k 

Cy = 

2 m 


(32) 


This choice of 0 is dictated by the equipartition principle. There are 
three degrees of freedom of translation of the center of mass of the 
system of two particles whereas there are two degrees of freedom of 
rotation, namely about two mutually perpendicular independent axes 
also perpendicular to the line joining the two particles. Now if the 
equipartition principle applies also to rotational energy we should 
expect that at given temperature the ratio of the average kinetic 
energy of rotation to that of translation would be precisely %. This 
is the basis of the choice in (32) * This formula holds well for hydro- 
gen. The substitution of m g , and k into (32) gives indeed 

cy — 2.46 cal /gram degree C 

in fairly close agreement with the measured value for hydrogen at 
room temperature. 

The ratio of the specific heat at constant pressure to that at con- 
stant volume, y — c p /cy can also be handled by the application of 
the preceding considerations to eq. (25) of Chapter III The latter 
can be rewritten in terms of the actual specific heats (instead of the 
heat capacities) as follows 

Cp — Cy = R m , (33) 

it being understood that comparable units are employed on both sides. 
For monatomic gases we therefore have from eq. (29) 



This ratio is found to hold pretty exactly for the rare gases of the 

• See, for example, Kennard, op ctf , p 365, for a proof that the equipartition 
principle holds equally well for rotational as for translational energy 
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atmosphere. For diatomic gases like hydrogen and oxygen, on the 
other hand, we have from (32) 

T = i (35) 

in rather good agreement with experimental results for the quasi-ideal 
diatomic gases. As the complexity of constitution of the molecule 
increases, cy increases and we expect y to approach unity. This again 
is experimentally confirmed to a considerable extent. For the statis- 
tical theory of specific heats, cf. Chapter IX. 

3. COLLISIONS AND MEAN FREE PATH OF A GAS 

In our discussion so far we have entirely neglected the possibility 
that the molecules of a gas may collide with each other. We have 
indeed assumed effectively that the molecules are geometrical points. 
If they are actually possessed of finite extension, however, they will 
be bound to hit each other in their flight and such collisions conceiv- 
ably should have an important bearing on the properties of a gas. 

For the elementary considerations of the present section we shall 
assume that the molecules are perfectly elastic spheres of diameter D. 
Suppose that all the molecules save one are instantaneously at rest, 
whereas that one moves with respect to the others with average rela- 
tive velocity v Denote the average number of collisions per second 
experienced by any molecule by Z e - The distance between successive 
collisions is called the free path of the molecule and the average value 
of a large number of free paths is termed the mean free path. We 
shall denote it by A. Clearly, if the average velocity of the molecules 
is given by » m , we have the fundamental relation 

A = (36) 

Naturally the precise value of A for given Z e depends on the value 
chosen for the average velocity. The root-mean-square velocity is the 
one usually adopted. 

The average number of collisions per second for any spherical mole- 
cule of diameter D is the same as the average number of collisions per 
second for a single spherical molecule of radius D moving through a 
field of point molecules. Consequently we can get an approximate 
value for Z c by multiplying the average number of molecules per unit 
volume by the volume of the right circular cylinder traced out by a 
circle of radius D moving with the average relative velocity tv. Thus 

Z e = it l^v r n, 


(37) 
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where n is here the number of molecules per unit volume. In order to 
utilize (37) it is necessary to evaluate v T in terms of v m . This can be 
readily done in the simple special case for which all the molecules have 
the same velocity magnitude, viz., v m . Consider the two molecules 



whose paths make the angle 0 with each other (cf. Fig 5 2). 
relative velocity is clearly 

. 0 
v r = 2v m sin - • 


The 


(38) 


Hence 


v r = 2 iv, sin - , 


(39) 


and we must calculate the average of sin (6/2) over the whole collection 
of molecules. Now if we fix our attention on a single molecule the 
probability of finding another molecule with its velocity included in 
the angular region dO between 6 and 0 + dd from the direction of the 
first molecule is simply 

sin 6 dO. (40) 

This may be seen from the fact that we are assuming the velocity 
directions of the molecules to be symmetrically distributed so that no 
particular direction has greater a priori probability than any other. 
Hence the fractional number of molecules having their velocity direc- 
tions at any instant included within the range 0 and 0 + dd with any 
arbitrary direction is the ratio of the solid angle included between 
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0 and 8 + dd to the whole solid angle about a point, viz., 4x. This 
ratio is readily seen, however, to be 


or the value given in (40). 
average quantity 

. e 

sin - = 

2 

Note that the integration over 6 is taken between the limits 0 and x. 
We see indeed that 

8d& = 1. (42) 

The evaluation of the integral in (41) leads at once to 



2x sin 8 — , 

4x 

Hence, from the general definition of an 

Q 

- sin 8d6. (41) 



and hence from (39) 
Consequently (37) becomes 
and the mean free path is 



Vr 

z e 

X 


2 


3 ’ 



(43) 

fxZ5 a nr m , 

(44) 

3 

4x25 z » 

(45) 


This result was first obtained by Clausius. 4 The mean free path is 
thus seen to depend on the number of molecules per unit volume and 
the diameter of each. The former quantity n can readily be obtained 
from the Avogadro number and the fact that the volume of a mole of 
any ideal gas is equal to 22.41 X 10 3 cm 3 under standard conditions of 
temperature and pressure. Hence approximately at 0° C and 76 cm 
of Hg 

n = 2.70 X 10 le . (46) 

How shall we get a measure of 25? Many methods are available,* but 
we shall consider only one. This has to do with the viscosity of a gas, 
in itself an interesting topic. 

4 Cf Kennard, op. cit , p 105. 

• Cf. Loeb, op. cit. Appendix I, pp. 523 ff. 
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4. ELEMENTARY KINETIC THEORY OF VISCOSITY 

The viscosity of a gas is one of the most significant effects of the 
molecular motion within the gas on the large scale motion of the gas 
as a whole. It will be recalled that in all actual fluids the motion of 
any part of the fluid is to a certain extent resisted by the rest. This 
effect is attributed to a so-called viscous force which each layer of 
fluid exerts on the immediately adjacent layer. Newton made the 
assumption that the viscous force is directly proportional to the flow 
velocity gradient or rate of change of velocity with distance normal 
to the direction of flow. Moreover, it is also assumed that the viscous 
force is proportional to the area of the contiguous layers. The con- 
stant of proportionality is called the coefficient of viscosity, or more 
briefly, the viscosity of the fluid. If we denote the area in question 
by A, the flow velocity gradient by dV/ds, and the viscosity by v, 
the viscous force then may be written 

dV 

F =r,A- (47) 

The viscosity is thus the viscous force per unit area per unit velocity- 
gradient. In absolute units its dimensions are dyne second/square 
centimeter. In these units the value for water at 20° C is 0.01. As 
might be expected, the viscosity values for gases are very much 
smaller. That for hydrogen at 0° C is 8 4 X 10~ s in absolute units. 
The viscosity of liquids decreases as the temperature increases, but 
that of gases increases with the temperature. 

The explanation of the viscosity of a liquid is probably to be 
found in considerable measure in the cohesive forces between the 
constituent parts. This explanation is not available for ideal gases 
in which such forces are ignored. Yet even a gas like hydrogen, 
which at ordinary temperatures approaches close to the ideal variety 
in its other properties, possesses definitely measurable viscosity. 
Maxwell was the first to give a kinetic theory description of the vis- 
cosity of a gas in terms of the motion of the molecules and in particular 
the transfer of momentum by the random motion of the molecules 
from one moving layer of gas to another. An elementary discussion 
based on this idea follows. 

Consider a gas which is flowing as a whole from left to right. 
Draw the parallel horizontal planes A, P, and B (Fig. 5-3) which 
contain the direction of flow, and suppose the flow velocity of the 
gas in plane A is V\, while that in plane B is V 2 , where A and B are 
chosen a distance apart equal to 2\. The plane P is equidistant from 
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A and B. If we make the assumption that all the molecules have 
the speed v m , the average number of molecules traveling downward 
across unit area of P per second is nv m /6 while on the average the 
same number of molecules travel upward across unit area of P per 
second. Because of the way the planes have been drawn, the mole- 
cules mentioned have suffered their last collisions (before striking P) 
in the planes A or B. If we assume that in passing through A or B 
each one instantaneously acquires the appropriate flow velocity Vi 
or V 2 , it follows that the nv m / 6 downward-moving molecules convey 
from A to the gas below P the flow momentum nv m /6-mVi per second 



per unit area while the nv m /6 upward-moving molecules convey from 
B to the gas above P the flow momentum nv m / 6 mV 2 per second per 
unit area. Now from Newton’s second law the downward transfer 
of momentum per second per unit area, viz., nv m /6 mV 1 represents 
the tangential stress exerted by the gas above the plane P on the gas 
below, while the upward transfer of momentum per second per unit 
area, viz., nv m /6-mV 2 represents the tangential stress exerted by the 
gas below the plane P on the gas above. The equal and opposite 
reaction (Newton’s third law) on the gas below is therefore 
-nv m /6-tnV 2 . Hence the resultant tangential drag on unit area of 
the gas immediately below P is given by 


nv m /6'tn(Vi - V 2 ). 

By definition this corresponds to F/A in eq. (47). Moreover the 
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velocity gradient in the neighborhood of P is clearly to a good 
approximation 

dV = (7i - V a ) 
ds 2\ 

Hence the viscosity n becomes at once 


V 


nmv m \ = 

3 3 


(48) 


where p is the density of the gas. This fundamental relation has been 
derived here by a rather crude method making use of somewhat ques- 
tionable assumptions. More careful and elaborate deductions are to 
be found in the standard kinetic theory textbooks.* By substituting 
for v m from (23) and X from (45), the expression for tj becomes 


_ VZkmT 
n ~ 4xD 2 


(49) 


This brings out the interesting fact that the viscosity of a gas should 
be independent of the pressure and therefore of the density. This 
prediction was first made by Maxwell who also verified it experi- 
mentally over a wide range of values. The variation with the square 
root of the absolute temperature has also received ample experimental 
verification. For a description of experimental methods of measuring 
il, Loeb’s book may be consulted. Equations (48-49) may now be 
used to obtain numerical estimates of X, D, and Z c . Thus for hydrogen 
under standard conditions we obtain 

X = 1.7 X 10 -6 cm. 

D 10~ 8 cm. 

Z e ~10 10 sec -1 . 


5. THE MAXWELLIAN DISTRIBUTION OF VELOCITIES 

Although at the very beginning of this review of kinetic theory 
we included among the fundamental ideas the assumption that the 
molecules may differ in velocity, in the applications so far considered, 
particularly those in Sec. 3, we have actually assumed that effectively 
all the molecules move with the root-mean-square velocity v m . It 
now remains to be seen how the variation in velocity may be taken 

• Cf. Kennard, op. ctt., pp. 138 ff ; Loeb, op. ctt., pp. 180 ff, Also cf. Page, op. 
cit., p. 343. 
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into account. We must first consider how the molecules are dis- 
tributed on the average with respect to their velocities. This is the 
main problem of kinetic theory and many deductions have been given, 
the most famous being those of Maxwell and Boltzmann. From the 
purely dynamical point of view the problem is to consider an aggre- 
gate of confined elastic spheres which are continually colliding with 
each other, with the velocity of each sphere usually changing at each 
collision. However at any given instant there will be a certain number 
of spheres having velocities lying within any initially prescribed 
velocity interval. It is a fundamental assumption of kinetic theory 
that this number ultimately reaches a value which does not vary 
with the time as long as the gas represented by the aggregate remains 
in a state of equilibrium. The problem is to find this number. Boltz- 
mann attacked it from the standpoint of the average effect of the 
elastic impacts on the velocities of the individual particles. Maxwell 
disregarded impacts altogether and considered the distribution of 
velocities from the standpoint of pure probabilities. There has been 
much argument about these deductions. We shall not go into either 
but merely show how the distribution they arrived at more or less 
immediately emerges from the application of the classical statistics 
of Chapter IV to an aggregate of particles, with the assumption that 
the 6tate of equilibrium is represented completely by a canonical 
distribution with respect to the energy of the particles. The distribu- 
tion of velocities is then obtained at once from the canonical distri- 
bution, eq. (27) or eq. (27') of Chapter IV with Ej as the kinetic 
energy of a particle of mass m and velocity components v Xj , v Vj , v tj . 
In accordance with eq (1 7) of this chapter we set 0 = kT. For the 
partition function 2#ig ; e~' Bl/e we use the expression for Z given in 
eq. (83) of Chapter IV . The a priori probability g, is that given in (79) 
of Chapter IV. With these substitutions, the number of molecules 
having their velocity components in the interval v x , v x + Av x ; 
Vyi v v + Av„; v t , v, + Av t may be written 

ANs=N (^f) e~ m/2kT Av x Av y Av t . (50) 

This is the Maxwellian velocity distribution formula. If we denote 
(m/2vkT ) e m ' /,/2kT (with v 2 = r* -(- + v]) by f(v), we have 

AN „ 

— =f(v)Av x Av y Av x . (51) 

The function f{v) is the well-known probability or Gauss error func- 
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tion already introduced in Chapter II (cf. Fig. 2-1). The fractional 
number of molecules per unit velocity interval in the neighborhood 
of v is a symmetrical function of the resultant velocity, with the 
result that the average velocity is zero. 

Often a more useful type of distribution law than (51) is that for 
the fractional number of molecules having resultant velocity magni- 
tude in the interval from v to v + At/ This distribution thus disre- 
gards the direction of the ve- 
locity. If we imagine a momen- 
tum space in which each point 
represents a possible resultant 
momentum value for a mole- 
cule, the points corresponding 
to molecules with resultant 
momentum magnitudes between 
mv and mv + mAv will lie in ' p IG s 4 

a spherical shell of radius mv 

and thickness mAv, with volume 4irt» 8 i> 2 At>. Denoting by {AN)' the 
number of molecules corresponding to this volume gives 



(AN)' 4irm 3 v 2 Av 


AN mrAv x Av y Av t 


(52) 


By substitution for AN from (50) the desired distribution formula 
then proves to be 




m 


N 


\2ir kT/ 


_YV 


mt*/2kT a„ _ 


Av = 4>(v)Av. 


(53) 


The plot of 4>(v) yields the non-symmetrical curve shown in Fig. 5-4. 

We can use the non-symmetrical Maxwell distribution to calculate 
some average values of the velocity. First we shall compute the 
velocity corresponding to This will be denoted by v* and is 

d<(>( v) 

obtained by solving = 0. We get 


dv 


v * - 


2 kT 
m 


( 54 ) 


Next, the average of the absolute value of v is given by 
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On employing partial integration there results 

UTT - — 7 = v* ~ 1.13 v*. (55) 

We have already introduced the root-mean-square velocity as the most 
important average velocity in kinetic theory. The evaluation of p 8 
from (53) yields 

v a = f i?4>(v)dv = —— , (56) 

Jo tn 

with 



in agreement with the result of the simple theory at the beginning of 
this chapter. We have indicated in Fig. 5 4 the fact that 

Vm > | V | > V*. 

In practically all subsequent applications we shall employ p m . 

The average kinetic energy can also be found from (56) with the 
usual result, i e., 

■Ekm = ikT. (58) 

Many attempts have been made to verify the formula (53) experi- 
mentally. One of the more recent ones is that of Zartman, 7 who 
studied the deposition of evaporated bismuth atoms on the inside of 
a revolving cylinder and obtained a “velocity spectrum” in good 
agreement with the Maxwell law. 8 

6. APPLICATION OF THE MAXWELL DISTRIBUTION TO COLLISIONS 

The discussion of collisions and mean free path given in Sec 3 
can now be generalized to include the assumption of the Maxwell 
distribution. Let us consider the collisions between the set of mole- 
cules having velocity components in the interval v x , v x + dv x , etc , to 
be denoted as set 1 for convenience and the set having velocity com- 
ponents in the interval V x , V x -)- dV x , etc., to be denoted as set 2. 
The relative velocity of a molecule in set 1 with respect to a molecule 
in set 2 has the magnitude 

»r = V(V t - v x ) 2 + (V v - v v ) 2 + (V, - v,) 2 . (59) 

'Phys Rev 37, 383 (1931). 

• For a discussion of these and other similar experiments, the reader is referred 
to Kennard, loc. cit., pp. 71 f. See also “Atomic Physics,” Umv of Pittsburgh 
Staff, second edition, p. 11, 1937. 
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The average number of collisions per second which any single molecule 
of set 2 makes is (from Eq. 37) 


-f-® 

Z! e - rD 2 nv~ r = it D 2 n ^ SSS Vre ~ m * / 2 kT d v *d v iA v 


(60) 


where the average is taken over ail the molecules in unit volume by 
allowing v x , v y , v, to take on all possible values Evidently Z'„ is a 
function of V x , V v , V,. What we wish is the average number of col- 
lisions suffered by a molecule of any velocity. To get this from (60) 
we must multiply by the probability of finding a molecule of set 2 
and then integrate once more over the velocities. Thus the desired 
collision rate is finally 


= rD 2 n 


(=)'//////“- 


m(,« + W2 kT 


dv x dv v dv l dV x dV y dV„ (61) 


To evaluate the integral (61) we introduce the following transformation 


(62) 


a 

2 ’’ 

„, = « + - 

b 

2 ; 

b 

V * = P + 2 

c 

, c 

2 1 

V t = 7 + - 


Then 


v T = V a 2 + b 2 ■+ r. 


and substitution into (61) yields 


+ «0 


z - ffffff'" + * + A 

— 00 

e -(m/kD (a«+ 3 *+ 7 *) e -(’'‘/«r) &+»+<» .dadpdy dadbdc, 
where we have used the fact that 


(63) 


dVJVJYMj*, = ^ dod ^ dadUc - (64) 
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»{v., •••) 

d(a, ■ • ) 


is the Jacobian of the transformation, i.e. f 

dV x dV y d V„ dv x dVy dv, 

da da da da da da 


d(V x ,---) = 

d{a, • •) 

dV x dVy d V, dv x dv y dv, 

dc dc dc dc dc dc 


(65) 


In the present case this reduces to unity, leading to (63) To evaluate 
(63) we note first that 


JJJ e- (m/kT) dadfidy = (~^f • 


There remains the triple integration over a, b, c. We transform to 
spherical coordinates, i e., 


whence 

and 


a = w sin 6 cos <t>, b = w sm 0 sin <f>, c = w cos 8, 

a 2 + b 2 + c 2 = w 2 , 
dadbdc = w 2 sin 6 dd dtfr dw. 


This yields 
+ 00 


f f f ()dadbdc = f f f w 3 e~ sin 6 dO d<t> dw 

JJJ Jq Jq Jq 


<UT\ 
m ) 

and therefore the final result is 


= 2^ 


. 


= 2y/2^ D 2 n = 4 yjj D 2 m>, 


( 66 ) 


This value of Z c should be compared with that in eq. (44) which we 
obtained on the assumption that all the molecules move with the 
velocity v m . Since x/3 is very close to unity, the difference between 
the two formulas is slight, amounting in fact to only a little over 
2 per cent. While it is decidedly unsafe to generalize on the basis of 
a single illustration, this result does suggest that the invocation of 
the Maxwell distribution does not alter the general functional form 
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of simple kinetic theory formulas but may be expected merely to 
introduce a slightly altered numerical multiplying factor. This pre- 
sumption is realized to some extent by more extensive study, though 
the change in the factor may indeed be larger than in the case just 
discussed. 

The value of the mean free path obtained from (66) turns out to be 


X = 


V^3/t r 

i * 


(67) 


differing very slightly from our more simple version in eq (45). 


7. TRANSPORT PHENOMENA 


The phenomenon of viscosity as described by the kinetic theory 
of gases is an illustration of a group of effects known under the head- 
ing of transport phenomena. Other well-known illustrations are heat 
conduction and diffusion. It is not our intention to give a detailed 
discussion of these effects which are usually extensively treated in 
the standard kinetic theory texts * However, it will be worth while 
to discuss them qualitatively and give for reference some of the funda- 
mental formulas, particularly with a view to the influence of the 
Maxwell distribution on the results 

We have already mentioned viscosity in Sec 4 and have given 
a “ derivation ” of a sort for the coefficient of viscosity of a gas. The 
application of the Maxwell law to the problem is a rather involved 
matter but finally leads to the result 

ij — 0 310pu m X, (68) 


differing from the value in eq (48) by the difference between % and 
0.310. Actually the problem proves to be more complicated than 
even the Maxwell distribution would make it; in fact account must 
be taken of the fluctuations which occur from the precise Maxwell 
distribution and which exert a considerable influence on the viscosity 
which tends to increase the numerical factor 10 in (68) to nearly 0 5. 

The conduction of heat through a gas, like that through a solid, 
takes place whenever a temperature gradient exists The thermal 
conductivity k is defined by the fundamental equation 


dQ _ _ dT 
dt dx ' 


(69) 


’ Cf , for example, Kennard, op at , pp 135 ff. 

10 This will be true only for an infinitely rare gas Thi9 result is due to Chap- 
man. For the reference, consult Kennard, op ctt , p 147. 
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where dQ/dt is the rate of flow of heat per unit area per second in 
the x direction and dT/dx is the temperature gradient in this direc- 
tion. From the standpoint of kinetic theory thermal conduction is 
due to the transfer of the energy of the on-the-average faster moving 
molecules from the high temperature region to the lower temperature 
region. Of course molecules from the low temperature region also 
move to the higher temperature region but they carry less energy 
and hence there is a net transfer of energy in the direction of decreas- 
ing temperature. We can even give a simple theory of thermal con- 
duction by considering the transfer of energy from one layer of a gas 
to another just as we treated the transfer of momentum in the 
simplified discussion of viscosity in Sec. 4. 

Let us refer once more to Fig. 5-3 and assume that the average 
kinetic energy per molecule in plane A is E\ while that in plane B 
is E 2 . By utilizing assumptions similar to those employed in Sec. 4, 
we arrive at the conclusion that the nv m /6 molecules which cross 
unit area per second moving downward from A, transfer the kinetic 
energy nv m /6-Ei, while those moving upward from B, transfer the 
amount nv m /6-E 2 . There is thus a net transfer of kinetic energy of 

^ (Ei~E 2 ) 

over a distance of 2X. But we can write 


Ei — £2 dE 
2X = Is ’ 


where dE/ds is the gradient of kinetic energy in the direction of tem- 
perature change. The rate of flow of heat energy per unit area per 
second then becomes 


dQ nv m \ dE 

dt 3 ds ’ 


(70) 


where the negative sign emphasizes that the flow takes place along 
the negative gradient. Now we can get a connection with (69) by 
observing that 


dE _ dE dT 

(71) 

ds ~ dT ds' 

1 dE 

(72) 

k = - nv m \ — 

3 dT 


whence 
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But « — is the specific heat of the gas at constant volume per unit 

volume and divided by the density is the ordinary specific heat cy 
(eq 28). Hence for an ideal homogeneous gas this simplified theory 
yields 

* = = l\Cy. (73) 

This is an extremely interesting result, connecting as it does the 
thermal conductivity with the viscosity and specific heat at constant 
volume. All these are independently measurable quantities so that 
the relation provides a good test of the success of kinetic theory. 
Since the derivation is idealized and all intermolecular action is ignored, 
we cannot expect indeed that (73) will be satisfied exactly for any gas 
though there should be agreement in order of magnitude. A table of 
experimental values quoted by Kennard 11 indicates that the ratio 
k/hcv for the rare gases helium, neon, and argon averages about 2 45. 
For hydrogen, nitrogen, and oxygen the ratio is around 2. It is inter- 
esting to note that for the gases with more complicated molecules like 
NH 3 , the ratio drops to approximately 1 5. It is clear that the rela- 
tion (73) has only order of magnitude validity and further study is 
necessary to establish an accurate relation. This has been done by 
Chapman (cf. Kennard, as before, for references) who found that for 
monatomic gases (73) should be replaced by « = 5/2 *)Cy. This 
is in good agreement with experiment. On the other hand for poly- 
atomic gases Eucken has derived the formula k = 1/4 -(97 — 5)t)Cy 
(with 7 = Cp/cy) which is also in excellent agreement with experi- 
ment. 

Diffusion takes place in a gas composed of two or more different 
kinds of molecules where the concentrations vary from point to point. 
The result is an eventual evening out of differences of composition 
with the ultimate attainment of uniform concentration throughout 
the gas. Kinetic theory describes this process in terms of the greater 
probability of molecules of a given kind to move from a region of 
high concentration to one of low concentration than in the reverse 
direction. This, of course, tends to wipe out differences in concen- 
tration. Diffusion may be described quantitatively in terms of the 
diffusion coefficient. Suppose the gas contains two different kinds 
of molecules with concentration rij and (number of molecules per 
unit volume). These are functions of both space and time. An 
analysis of the problem, into which we shall not go, shows that for 


11 Op. tit., p. 180 
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an ideal gas the variation of «j and n a with / and x,y,z is expressible by 
the fundamental differential equations 


dttj 

at 


DV 2 n lt 


drut 

at 


DV a n a . 


(74) 


In these equations D is called the diffusion coefficient for the mixture 
of the two gases. It has the dimensions of square centimeters/second. 
The reader will note the interesting resemblance between eqs. (74) and 
the ordinary differential equation for heat conduction, obtained from 
the definition eq (69). Thus if T is the absolute temperature 


dT 

at 


— • v 2 r. 

pcy 


(75) 


A simplified theory of diffusion due to Meyer leads to the follow- 
ing equation connecting D with the mean free path 


-( n2Vm1 ^ 1 W l p w 2^2 

3 \ »i 4* n a 


(76) 


where the subscript 1 refers to the molecules of the fu-st kind and 2 
to those of the second kind. It must be noted that the mean free 
paths in this formula are those given in terms of the root-mean-square 
velocity. The formula (76) takes no account of the Maxwell distri- 
bution nor of the more recent Chapman theoretical considerations. 
Kennard’s book should be consulted for their effect. 


8. TH£ BOLTZMANN //-THEOREM 

In Sec. 5 we introduced the Maxwell distribution of velocities 
of the molecules of a gas as a kinetic representation of the canonical 
distribution of particles in an aggregate with respect to their kinetic 
energy. This was based on purely statistical considerations in accord- 
ance with which the canonical distribution and hence the Maxwell 
distribution is the most likely one because there are more ways 
in which it can be realized than any other. As such we took it to 
correspond to the state of equilibrium of the aggregate. From the 
standpoint of pure kinetic theory this might appear to be a somewhat 
gratuitous assumption. When we realize that the molecules are in 
the continual process of changing their velocities by collision, how 
can we be sure that any steady distribution of velocities will ever be 
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obtained, or that if such does occur the distribution may not actually 
be different from the Maxwellian? It is worth pointing out that 
these questions do not arise in a strictly statistical theory. Never- 
theless they are fundamental in kinetic theory. They were answered by 
Boltzmann, 18 and the answer is included in his celebrated //-theorem. 
Boltzmann investigated the quantity 

*-/ fiogfdk, (77) 

where 

/ = f(.Vz,v v ,v„t) (78) 


is the velocity distribution function, e g., the Maxwell f(v) in our 
eq. (51) (divided by the constant {m/2xkT) l,t to make it non-dimen- 
sional) and dk = dv x dv y dv t . The integral in (77) is a triple one with 
the limits from — oo to + oo. We have indicated in (78) that / may 
be a function of the time; it is not, of course, in the Maxwellian case. 

Boltzmann proceeded to form dll/dl from (77) in terms of df/dt. 
The latter quantity can be expressed in terms of / itself by means of 
a rather general and somewhat elaborate study of the effect of col- 
lisions on the distribution function. We shall not go into this, but 
merely state that without specification of the precise form of /, it 
develops that 


"so. 

dt 


(79) 


In words, the function II cannot increase with the time. In particular 
for a steady distribution dll/dt must vanish. But from the form of 
dll/dt, its vanishing implies necessarily that f have the Maxwellian 
form. The Maxwellian distribution then is the only steady one, i.e , 
the only one which does not change with the time. This is the essen- 
tial content of the //-theorem. However, one can go on to show that 
II has a minimum value for the Maxwell distribution as compared 
with all others having the same v m , i.e., the same average energy. 
Hence we see that the effect of the collisions is to make / ultimately 
assume the Maxwellian form if at any initial instant it does not 
possess it. One could even get an idea of the rapidity with which 
the Maxwell distribution is approached by computing dll/dt. For 
ideal gases under ordinary conditions of temperature and pressure 
this rate is very rapid indeed. It can be shown 18 that if the initial 
distribution is non-Maxwelhan and the mean square velocity com- 

12 K Akad. Wiss, {Wien) Sitzungsbenchte 66, 275 (1872) 

u Cf. Jeans, op at , p. 243. 
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ponents in the three coordinate directions differ, the time in which 
the difference between any two such components becomes 1/e of its 
initial difference is of the order of 



where 17 is as usual the viscosity and p the pressure. This time was 
called by Maxwell the “ relaxation time.” Thus for hydrogen at 0° C 
and standard conditions 

t 8.4 X 10 — 11 sec. 

It will be noticed that this ts comparable with the time taken by a 
hydrogen molecule under these conditions to traverse a free path 
equal to the mean free path. It gives a good idea of how quickly 
any deviations from the Maxwell distribution may be expected to 
disappear. 

In our later study of statistical mechanics we shall find an inter- 
esting connection between the II function of Boltzmann and the 
entropy. The theorem (79) will be found indeed to have a close re- 
lation with the law of increasing entropy of an isolated system. 

9. EQUATION OF STATE OF A REAL GAS. VIRIAL FOR INTERACTION 

FORCES 

The treatment of kinetic theory in this chapter has been limited 
to the ideal gas in which forces between the individual molecules, 
aside from those arising from collisions, are entirely neglected. Much 
of the lack of precise agreement between the results of kinetic theory 
and the behavior of actual gases has been traced to this neglect. For 
example, the equation of state 

pV = NkT, 

derived from the simple kinetic theory in Sec. 1, describes a real gas 
accurately only when it is well above its critical temperature. Efforts 
have naturally been made to obtain a more satisfactory equation of 
state by taking into account molecular interaction. In spite of the 
years of investigation from Maxwell’s time to the present, this subject 
can hardly be considered to be in satisfactory condition from a 
theoretical point of view. However, in this section we present an 
introduction to the subject, adopting the method of the viriai already 
used for the ideal gas in Sec. 1. 

We shall suppose that the interaction forces in question are central. 
i.e.| depend only on the distance r X} from the tth to the jth particle 
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If tm is the mass of the ith particle and* m } that of the jth, it will be 
assumed that the force between the two may be expressed as 
Hence the * component of the force may be written as 

K = ».■»*,/(»•„)(*. - x,)/r t „ (81) 

where x, and are the * coordinates of the two particles respectively. 
We must now apply this to the computation of 0 in eq (6). Con- 
sider first simply the two particles t and j. The contribution to the 
virial arising from their mutual interaction is then 


1 

— - ftiittij 
I 


M,) 


[*»(*» — x 3 ) + x 3 (x 3 — x,) + similar terms in y and z] , 


which becomes on reduction and combination of terms 


- £ «iW,/(r„)r„ ( 82 ) 

There is an important point about (82) which deserves attention. 
Due to the symmetrical average distribution of the particles the 
average foroe on each is zero (save close to the walls of the container). 
We must be careful not to conclude, however, that therefore 
f(r l3 )r, 3 = 0. 

Summing over the whole aggregate of particles we have 
- £ ^2 m ' mj ^ r v) r o- 

where the summation is extended over ail pairs of particles If now 
we combine this with the contribution to the virial due to the surface 
forces (eq 11), the virial theorem yields 

| NkT = ipV - £ y] tn t m J f(r lJ )r l} . (83) 

*! 3 

The equation of state then takes the general form 

pV = NkT + £ ^ m<mJ(r tJ )r l} (84) 

If we were acquainted with the precise nature of the intermolecular 
forces, i.e., the/(r„), and could evaluate the sum involved in the second 
term on the right-hand side of (84) we should have a theoretical de- 
duction of the exact equation of state of any real gas. However, 
we do not know the intermolecular forces, except in so far as better 
information about them is being obtained by quantum mechanical 
studies which are not exjnsidered in the present purely classical treat- 



96 


THE KINETIC THEORY OF GASES 


[Ch. V 


ment. Of course one might try various force laws and see how the 
resulting equations compare with the empirically known equations of 
state. The difficulty comes, however, in carrying out the evaluation 
of the summation for any force law. We can at any rate draw certain 
semi-quantitative conclusions from the general expression (84). In 
the first place we can recall the tx aring of an important observation 
of Maxwell on the equation. From a certain point of view the equa- 
tion says that the pressure of a gas may be thought of as arising from 
two sources, i.e., (a) the motion of the molecules symbolized by the 
term NkT in (84) and, (6) the intermolecular forces symbolized by the 
summation term in (84). One finds it interesting to ask what would 
be the result of assuming that the pressure arises primarily from the 
forces rather than from the motions. This of course would imply 
repulsive forces between the molecules. But Maxwell observed that in 
this case Boyle’s law pV = constant at constant temperature would 
demand that f(r l} ) should vary as 1 /r tJ , for only in this way could 
£/(r, ; )r,j be made constant. But this inverse distance force law would 
lead to the result that the force action of the portion of the gas far 
away from a particular molecule would be greater than that near at 
hand. (Recall that the number of molecules in the neighborhood of a 
particular molecule goes up roughly as the square of the distance from 
the molecule ) This would produce differences of pressure in vessels 
of the same volume but different shape, even at the same tempera- 
ture — an anomaly not observed. Maxwell therefore concluded that 
most of the reason for the pressure of a gas on the molecular theory 
must be sought in the kinetic energy term and the summation term 
must be considered a correction only. 

If we consider the intermolecular forces as cohesive in nature, i.e., 
as attractive but falling off in intensity very sharply with the distance, 
we can estimate the summation rather readily as follows. 

Let us first replace m x m } f{r x} ) by 4>(r XJ ). Consider the interaction 
between the group of molecules in volume dV placed for convenience 
at the origin of a system of spherical coordinates and the group in 
volume dV' all at distance r from the first group. Let n as usual 
denote the number of molecules per unit volume and neglect to a first 
approximation the effect of the cohesive forces on the uniformity of 
spatial distribution. Then the contribution of the forces between the 
two groups to the sum in question becomes 

n 2 dVdV'r<t>{r). 

The total contribution for the whole gas will be obtained by inte- 
grating the above over d V and d V' and dividing by 2 to avoid count- 
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ing the volume twice. Let dV' = 4*-r 2 dr. We need not worry over 
the precise upper limit of the integration since <f>(r) is assumed to fall 
off very rapidly with r. Hence 


2r0(r) = 


^m; 


r 3 <j>(r)dr. 


( 85 ) 


The average required by the definition of the virial is here effectively 
implied in the procedure of taking the average number of particles per 

unit volume. The / r 3 4>(r)dr is a negative constant which we may 


call —a', while 


J' d V — V, the total volume of the gas. Hence 

„ 2irW 

2r0(r) = — (86) 


Equation (84) then becomes 


P = 


NkT 


2 iriVV 

3 V 2 


If further we let 2/3 irN 2 a' = a, a constant which does depend, of 
course, on the total amount of gas, the equation of state becomes 


O’ + fs) 7 - 


NkT. 


(87) 


When the molecules get very close together it is necessary to assume 
that the attractive forces become repulsive The calculation of the 
contribution of these repulsive forces to the virial is a more compli- 
cated matter. It is c arried out by Jeans 14 and leads to the result 
that we must add to 2 r<f>{r) the term 

4 

3bNkT 


where b has the value 


V ’ 

b = iNirD 3 , 


( 88 ) 


(89) 


the molecules still being supposed to be elastic spheres with diameter 
D. The result of (88) is to modify eq (84) still further. If b/V is 
small the resulting equation may be written 


[p + y)lV-b)-mT. 


( 90 ) 


u Op ett, p 131. Cf also Loeb, op ett , p 138. 
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This is the familiar van der Waals’ equation, which is fairly successful 
in describing the behavior of many real gases. It is interesting to 
note that b appears as equal to four times the total volume of all the 
molecules in the gas. In many discussions of van der Waals’ equation 
it is introduced as a correction to the volume of the gas to allow for 
the volume actually occupied by the molecules. In the present dis- 
cussion it arises more properly as a contribution from the very short 
range repulsive forces which must exist between the molecules. 

10. THE BROWNIAN MOTION AND MOLECULAR REALITY 

In concluding this brief survey of kinetic theory we encounter a 
question which must often have been seriously asked in the early 
days of the theory: Are molecules real? Do they actually exist and 
can we know about their existence more directly than by the success 
of the theory as a description of thermodynamical phenomena? From 
the standpoint of modern physical methodology these questions have 
little or no meaning. Nevertheless the attempt to answer them led 
to an exhaustive study of an interesting phenomenon and shed much 
light on other kinetic properties. The phenomenon takes its name 
from the English botanist Robert Brown, who in 1827, noticed the 
irregular but ceaseless motion of small particles, e g., gamboge, sus- 
pended in a liquid. The same phenomenon is also exhibited in striking 
fashion by smoke particles suspended in air. At first the motion was 
thought to be of organic origin but after the rise of the kinetic theory 
it became clear that the only reasonable explanation for it lies in the 
assumption that the particles are subject to the continual bombard- 
ment of the molecules of the surrounding medium. The most com- 
plete experimental study of the phenomenon is that of Perrin “ The 
theory has been developed by a number of investigators, including 
Einstein, Smoluchowski and Langevm. We shall present here the 
method due to Langevin 18 

Consider the motion along the x axis of a single particle in a viscous 
medium. The equation of motion is, by an extension of Stokes' law 
for the fall of a sphere through a viscous fluid, 17 

mx + 6mjax = X. (91) 

Here m is the mass of the particle, supposed to be a sphere of radius a, 
and the viscosity of the medium is r }. The X on the right represents 

u Cf “ Brownian Movement and Molecular Reality," trans. by F Soddy, Taylor 
and Francis, London, 1910 

u P. Langevin, Comptes rendus, 14<S, 530 (1908). 

17 See, for example, Page, op. ctt , p. 273. 
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the unknown force due to the molecular impacts. All we know about 
it is that it is positive as often as it is negative and sufficiently large 
to maintain motion. 

It will be convenient to rewrite eq (91) in terms of { = * - * 0 , 
which is the actual displacement of the particle in time t if x 0 is the 
initial distance from the origin. Then multiplying through by f, we 
obtain 

m'tt + 6iri jatf = X(. (92) 

We can write at once 

d 2 f 2 

w = 2 * + 2 * 

Therefore 

f~|--m{ 2 + 3 T , a ^a 2) =X{. (93) 

If next we average over a large number of identical particles, X£ 
becomes vanishingly small. We set z = £ 2 and get 

tn — 

jz + 3T1IOZ = m£ 2 . (94) 

Now if the particles can be thought of as forming an ideal gas the 
principle of equipartition of kinetic energy should apply and we can 
set mj 2 = kT. Hence eq. (94) becomes 

, 2 kT 6ir?jaz 

2 , (95) 

m m 

which on integration yields 

z = kT/3vi,a + Ce~ &rvat/m . (96) 

Here C is a constant of integration If the density of the material 
composing the particles is, let us say, about 1 2 grams/ cm 3 ; and if 
the viscosity is that of water, i e , approximately 0 012 gram/cm sec; 
and if a = 0.2 X 10 -4 cm, we have 

bir-qa/m ~ 10 8 sec -1 . 

Hence as far as any steady state is concerned we may safely neglect 
the last term in (96). That equation then becomes 

£ |2 _ tH. 

dt 4 3irija ’ 

yielding on integration _ _ 

£ 2 = kT/3vi/a-T + fo 


( 97 ) 
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for the mean square displacement along the x axis at time r, if £o is 
the initial mean square displacement. But we can always choose 
£§ = 0 and have finally 

£ z = kT/S-w-qa-T, (98) 

which is the equation for the Brownian motion first deduced by 
Einstein, giving the mean square displacement as a linear function 
of the time. As an illustration consider a typical experiment of 
Perrin in which 50 gamboge particles all with radii very close to 
a — 0 212 X 10 -4 cm were suspended in water at 20° C. The mean- 
square displacement component along one direction, which we may 
take to be the x axis, 18 was measured for the value r = 30 sec. The 
result was p — 4 5 X 10 ~ 7 cm 2 . Substitution into (98) yields a 
value for the Boltzmann constant 

k = 1.23 X 10 -18 ergs/degree C. 

This is of the correct order of magnitude and the result may be taken 
as a confirmation of the essential success of the theory and the value 
of the Brownian motion for the study of molecular phenomena. 

PROBLEMS 

1. Use the virial theorem to prove that in an aggregate of particles which are 
not confined in a box but which nevertheless have their motions bounded and attract 
each other with a force varying inversely as the square of the distance of separation 
the time average of the total kinetic energy is equal to minuB one-half the time aver- 
age of the total potential energy. 

2. One thousand molecules of hydrogen are in thermal equilibrium at 0° C 
Find the number with speeds between 0 and 100 meters/sec, 100 and 200 meters/sec 
and so on up to 3,000 meters/sec Plot the results m the form of a curve and indi- 
cate on it the positions of »*, | v | and v m 

3. In Problem 2, find the total number of molecules whose speeds are less than v m . 
What is the probability that a molecule shall have its speed lying between v* and » m ? 

4 . In Problem 2, find the total number of molecules whose component velocity 
along the x axis lies between 500 mclcrs/sec and 550 meters/sec, whose component 
velocity along the y axis lies between —500 meters/sec and —450 meters/sec and 
whose component velocity along the z axis lies between 600 meters/sec and 650 
meters/sec To what range of directions and resultant speed does this correspond? 

5. The mass of an electron is 9 X 10~ M gram. In an ideal gas composed of free 
electrons, i e , mutual interaction neglected, calculate the root-mean-square velocity 
at 0° C. Compare the pressure exerted by the ideal electron gas on the walls of the 

“Strictly speaking one ought to consider the component displacements along 
the y and z axes also. The analytical treatment is the same as above Perrin 
should be consulted for details on this and other experimental points. 
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confining vessel at 0° C with that exerted by helium gas at the same temperature, 
the number of particles per unit volume being assumed to be the same in each case 

6. Fifty cubic centimeters of a certain mixture of oxygen and nitrogen at 0° C 
contain 2 X 10® molecules of nitrogen and 5 X 10 u molecules of oxygen. Calculate 
the partial pressure due to each gas as well as the pressure exerted by the mixture. 

7. Generalize the Maxwell distribution (50) to the case of an aggregate of inde- 
pendent particles in the constant gravitational field at the surface of the earth. 
Relate the result to the well kno wn variation of density with height in the atmosphere. 

8. Carry out the evaluation of the Jacobian of the transformation (62) and prove 
that it is equal to unity. Find the Jacobian of the transformation from rectangular 
to spherical coordinates. 

9. Evaluate the mean free path for hydrogen, helium, and nitrogen molecules 
under standard conditions. Plot the mean frie path of hydrogen molecules as a 
function of the temperature from 0° C to 300° C 

10. In van dcr Waal’s equation of state (cq 90) show that if the critical specific 
volume, critical pressure and critical temperature are employed as units of specific 
volume, pressure and temperature, respectively, the equation takes the form 





it being understood that one gram of gas is in question and that p’ = p/p r , »' “ v/v e , 
and T' = T/T c . For chlorine at 10 atmospheres and T = 293° K, find the values 
of v Which value corresponds to the purely gaseous state? 



CHAPTER VI 


CLASSICAL STATISTICAL MECHANICS 

1. THE CHARACTERISTICS OF STATISTICAL MECHANICS 

The discussion of the preceding chapters on the application of the 
statistical method to physics may arouse wonder regarding the next 
step in this study. Conceivably we might develop the kinetic theory 
further. This, however, would necessarily involve the detailed study 
of forces between molecules, lying outside the purely statistical ideas. 
These matters are sufficiently discussed in professional treatises on 
kinetic theory. We have seen that a rational foundation for thermo- 
dynamics is provided by the Maxwell-Boltzmann statistics as treated 
in Chapter IV. This might appear to satisfy all our needs and render 
further investigation unnecessary save for the mere enumeration and 
solution of specific applications which after all are a part of thermody- 
namics anyway. 

The reader will certainly have noted, however, as a somewhat queer 
circumstance that whereas the kinetic theory employs mechanical 
ideas in its set-up, the Maxwell-Boltzmann statistics is essentially 
non-mechanical. Fundanentally all its results are based on the dis- 
tribution of objects or e itities of any sort in groups with respect to 
some property or properties. Moreover the entities in question are 
assumed to be independent and without mutual influence. It is true 
we have introduced a few special applications of the Maxwell-Boltz- 
mann point of view to mechanical systems consisting of free particles. 1 
Further progress would now appear to be possible in the construction 
of a new point of view in which mechanical and statistical concepts are 
welded together from the beginning in a single unified theory. The 
postulates of such a theory, with which we shall associate the name 
statistical mechanics, will necessarily appear general and abstract. 
Nevertheless we can hope that they will serve as a rational basis for 
thermodynamics in the sense that the theorems deduced from them 
will form the laws of thermodynamics and that from the results of the 

1 Moreover in Sec 9, Chapter V, we discussed the kinetic theory for a gas with 
interacting particles, but this transcends the strictly logical application of the 
Maxwell-Boltzmann statistics. 
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theory we can calculate (as averages, of course) the significant func- 
tions in the thermodynamical equations. If such a theory turns out to 
be successful, its very generality should guarantee its universal 
applicability. 

The first to found statistical mechanics in the formal sense just 
explained was J. Willard Gibbs 2 We shall refer to the method of 
Gibbs as classical statistical mechanics and it will form the subject 
matter of the present chapter. With the advent of quantum theory, 
statistical mechanics has turned in a new direction which will be con- 
sidered in detail in the latter part of this book. The fundamental 
ideas of Gibbs, however, are as important today as they were in his 
own time and no serious student of statistical mechanics can afford to 
neglect them. 

Since statistical mechanics employs in its formulation the concepts 
of advanced mechanics and in particular the canonical equations of 
Hamilton, it will be desirable to review these briefly 

2. REVIEW OF ADVANCED MECHANICS 

Consider an aggregate of n particles with masses wii, tn 2 - m n and 
position vectors rj, r 2 • r„ in some rectangular coordinate system. 
Thus, if Xj, y„ z } are the rectangular coordinates of the jth particle 

I, = ix, + jy, + kzj. 

It will be supposed that there are external forces acting on the particles; 
these are denoted by Fi, Fg • • F n . According to D’Alembert’s prin- 
ciple and the principle of virtual displacements* the motion of the 
aggregate is given by the fundamental equation 

n 

(mJj-Fj) Sr, = 0. (1) 

j^i 

Here 8iy represents a possible, i e , virtual, displacement of the jth 
particle subject to the constraints acting on the system. In general we 
distinguish between an actual displacement which a particle undergoes 
in time and a possible displacement which it might but does not 

•‘‘Elementary Principles in Statistical Mechanics,” Yale University Press, 1903. 
See also his Collected Works, Vol II, Part 1, 1930. This famous volume was once 
characterized by Henri Pomcarfe as a ‘‘little book, little read because it is a little 
hard " It ia indeed a model of conciseness. There has recently appeared a useful 
commentary on it by Arthur Haas in ‘‘Commentary on the Scientific Writings of 
J. Willard Gibbs,” Vol. II, Yale University Press, 1936 

•Cf. Lindsay and Margenau, “Foundations of Physics, p. 102. 
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necessarily have, by denoting the former as usual by dry and the latter 
by Stj. A brief examination discloses that £ has the same formal prop- 
erties as a mathematical operator that d possesses. Thus we can 
write at once 

*,-ary = | (fyir,) - *«(«£), ( 2 ) 


where vf = t } -t„ and is the square of the resultant velocity magnitude 
for the jth particle. Therefore from (1) 


Wj^y • Stj — mjtj-SXj — BT, (3) 

i-i i-i i-i 

where we have introduced T for the kinetic energy y * ^ i»*. If we 

integrate both sides of this equation with respect to the time from / 0 to 
<i, we get 


s 


Wjfj'Jrjl = f' STdt "f /" ^ ^ Fj • Sijdl. 
J<0 Jtt "t J m 1 


(4) 


Let us limit our attention to possible motions all of which have the 
same initial and final positions respectively; then Sr } will vanish for all 
j at both and t\ and (4) will become 



{BT + SF,-Sr,)dI = 


0. 


(5) 


Finally, let us assume that the forces are conservative. This means 
that a potential energy function V{x } , y„ z } ) exists such that 


Then since 


-Fi 



. .dV 


+ 


k 


dV 

dz, 


BV 



iV »V ,*V \ 

to, **' + to, ‘ y> + to, V 


we can write (5) in the more convenient form 4 



6(T - V)dt = 


0. 


( 6 ) 

(7) 


( 8 ) 


4 The mature reader will see at once the connection between (8) and Hamilton’s 
principle No discussion is needed here, however. Cf. the discussion in Lindsay 
and Margenau, op cit , pp. 128 l. 
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The quantity T — V is usually referred to as the Lagrangian function 
of the system and denoted by L. It is here a function of the coordinates 
and their velocities. 

We now find it convenient to abandon the rectangular coordinates 
in favor of generalized coordinates qi • • • qj, which may have any 
character as long as a knowledge of them as functions of the time is 
sufficient to fix the position of every particle of the system at any 
instant. Their number / is called the number of degrees of freedom of 
the system. In the case of the aggregate of n particles, free to move in 
three-dimensional space, we clearly have / = in. Any constraints on 
the freedom of motion of the system will, of course, decrease /. Now 
the q’a will necessarily be related to the rectangular coordinates, and 
we shall express the relation in the following way 

x, = *j(si • • q/)\ y, = yMi • • • qi) 

h = • • • 2/)- (9) 


The rectangular velocity components become 




( 10 ) 


From (10) the kinetic energy T can also be expressed in terms of the 
generalized coordinates and generalized velocities qi •••£/. It can be 
shown that T has the form of a homogeneous quadratic function of the 


q,. Thus 


T = 



a tk q4k, 


t.k- 1 


(ID 


where the a,* are functions of the q,. Clearly, since the potential 
energy V for a conservative system is a function of the x„ y„ Zj only, 
it will also depend solely on the q, The problem is to examine eq. (8), 
keeping in mind that L - T — V is a function of the q } and &. We 
could then obtain the Lagrangian equations of motion in the fashion 
followed in many books. Since these are not the ones useful for 
statistical mechanics, however, we proceed otherwise. 

Let us introduce the quantity p , defined by 

Pi = (12) 

d$, 

This will be called the momentum conjugate to q } . It is, of course, a 
generalized momentum component and only reduces to the ordinary 
momentum of Newtonian mechanics when is an actual velocity. It 
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will turn out to be a more important quantity for our further con- 
siderations than fa. Next we bring in the function II defined in the 
following way 



L. 


(13) 


The function H is known as the Hamiltonian function of the system. 
From its construction it might appear to be a function of the q„ q, and 
p } . Actually it reduces explicitly to a function of the q, and p 3 . For 
let us find how H changes when the q 3 , q, and p, change by dq } , dq, and 
dpj respectively. We have 

dH \Mb + 4APt ~ ^ d< b ~ ^ &!>)■ ( 14 > 

From the definition of p, in (12), the first and last terms in the paren- 
thesis cancel and we are left with 


dll 



(15) 


substantiating our statement that H is an explicit function of the q 3 
and p, only. We shall denote this by writing II in the form H(q„ p 3 ) 
which is an abbreviation of the longer form H{q\ • q j, pi pj)- 
This notation will be used extensively in what follows. 

Equation (8) now takes the form 



PA, ~ H{p„ 



dt 


0, 


which immediately becomes 


r f 


Taap, +^m, 


i-i 


i-i 




(16) 


0. (17) 


Partial integration yields 


f /.PMAt = - C / .PA % dt > 

* /<0 J-l •/lo j- 1 

since all the Sq, and dp, vanish at to and t\. Therefore (17) becomes 
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Now except for the restriction on the Sq, and hp, at to and h, they are 
otherwise completely and independently arbitrary. The only way in 
which (18) can be satisfied under these conditions is to have the inte- 
grand vanish identically for all fig/ and Spji The independence of the 
q, and p, then implies that we must have 


dH 
d P , - 


a h 

dq, 


— A- 


(19) 


These 2/ equations of the first order are known as the canonical equa- 
tions of motion, or alternatively as the Hamiltonian equations of 
motion. 

The connection between the Hamiltonian II (p„ q,) and the better 
known functions T and V becomes clear from its definition (13). Since 
V is independent of the we have 


and (13) becomes 


dT 

dq/ 

( 20 ) 

-i^- T+v - 

( 21 ) 


If we differentiate T in 
and sum, the result is 5 


Hence 


(11) with respect to then multiply by $/ 



( 22 ) 


H = T + V. 


(23) 


In other words, for a conservative system the Hamiltonian function is 
simply the total energy expressed as a function of the generalized 
coordinates and conjugate momenta. For example, for a simple har- 
monic oscillator with one degree of freedom 


H -£ + k -l 

H - 2 m + 2 1 


(24) 


* This follows at once, of course, from Euler’s theorem on homogeneous functions. 
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the mass of the system being m and the stiffness k. The Hamiltonian 
equations (19) reduce to two, viz., 


dH . . 

r q 

(25) 

dp m 

(26) 


The second equation is identical with the definition P = dL/d$, while 
the first equation is the ordinary Newtonian equation of simple har- 
monic motion. 

An important property of the canonical equations (19) is their 
invariance of form with respect to an arbitrary transformation of gen- 
eralized coordinates of the form 

Qj ~ (Mfli • • ‘ 2/). i = 1, 2 •••,/. (27) 

If we denote by T' the kinetic energy in terms of Q, and Q, and define 
the new conjugate momentum P, by 

P, - 

it develops that we still have 

a H' 

dP, " 

djr 
dQ, " 

where H' is the function of P, and Q, obtained by substituting from 
(27) and (28) into the original Hamiltonian H(p )t q } ). The fact that the 
Hamiltonian equations are written in terms of generalized coordinates, 
of course suggests this invariance property. We shall find it useful in 
the development of statistical mechanics. 

3 . PHASE SPACE 

The task of statistical mechanics is the description of the behavior 
of large scale bodies in the form of solids and fluids by assuming them 
to be dynamical systems with / degrees of freedom, where in general 
/» 1. The complete specification of the state of the system is given 
by the 2/ coordinates and conjugate momenta qi • • ■ qj, pi • ■ • pj. 
From this point of view the state is termed the phase of the system. It 
is customary to think of the 2/ quantities constituting the phase as 


r 

<V 


K 


(28) 


(29) 
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being represented geometrically by a point in a 2 f dimensional orthog- 
onal space called the phase space of the system. Each system has its 
own phase space. As the p’s and g’s vary with the time the phase 
point moves and traces out a path in phase space. This path is not 
entirely arbitrary since the p’s and g’s satisfy the Hamiltonian equa- 
tions of motion (19). 

It will be worthwhile to illustrate by a special case which lends 
itself to easy visualization even though it does not satisfy the condi- 
tion / 5^ 1 and hence is not of much importance in practical applica- 
tions: the simple harmonic oscillator for which/ = 1. Here H = p z /2m 
+feg 2 /2, where m and k are the mass and stiffness of the oscillator 
respectively, and the Hamiltonian equations become 





(30) 


The phase space is two-di- 
mensional, viz., the pq plane. 

p 

Every point of this plane 

represents a possible phase 
of a dynamical system of one ( 

\!zmE 

degree of freedom. If the V 

yfcETk ) 2 

total energy of the oscillator V 


is constant and equal to E its 


corresponding phase points 


are those lying on the ellip- _ , f 

tical path (cf. Fig. 6-1) 


II 

+ 

"*.11 

(31) 


T his ell ipse has the semi-major axis V 2E/k and the semi-minor axis 
"s/ 2 Em. At any particular instant the phase of the oscillator is repre- 
sented by some point on the ellipse. As time passes and the actual 
oscillator goes through its various phases in physical space we can 
think of the corresponding phase point as moving about the ellipse 
and repe ating this motion periodically with the period equal to 
Iry/m/k, the actual period of the oscillator; incidentally this Is equal 
to the area of the phase ellipse divided by the energy. The rate at 
which the phase point traverses the ellipse is given by eqs. (30). If the 
energy does not remain constant, the phase path will be more compli- 
cated, but if E is restricted to He between two values £i and Es, the 
path will certainly lie in the phase space between the two ellipses 
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corresponding to E\ and E 2 respectively. We shall have occasion to 
utilize this situation later. However, for the moment it will be simpler 
to confine our attention to a conservative system. It will be noticed 
that the phase curve in Fig. 6 - 1 nowhere crosses itself. No ph ase path 
representing a dynamical system can ever cross Itself ; such a situation 
would violate the fundamental characteristic of a dynamical system 
that the precise knowledge of the g's and p’s at one instant suffices to 
specify the phase for all past or future time. However, it is of interest 
to observe that since according to quantum mechanics this precise 
knowledge is never available (cf. the indeterminacy principle), infi- 
( nitely sharp phase paths have no meaning in quantum theory and 
must be replaced by paths of finite width, or phase ribbons, as they 
! may be called. In the present chapter we shall ignore this and restrict 
ourselves to the classical point of view. 

4. THE GIBBSIAN ENSEMBLE 

We are now prepared to introduce the concept of an ensemble. 
Let us imagine a collection of a great mary dynamical systems all 
having the same Hamiltonian function. 1 hey might, for example, 
be a collection of simple harmonic oscillators, all with the same mass 
and stiffness. We shall suppose that thur phases are not the same but 
are in fact distributed over a wide variety of possible phase values. If 
they are oscillators, their phases at ary instant might correspond to 
points selected at random on the phase ellipse in Fig. 6 1. The collec- 
tion in question is termed an ensemble. It must be emphasized that 
such an ensemble is a purely mental construction and has no concrete 
existence. This introduces at once an element of abstractness into 
statistical mechanics not shared by classical particle dynamics. We 
are asked to contemplate at once a very large number of copies of a 
dynamical system distributed somehow over all the phases possible for 
such a system. The copies of the system composing the ensemble will 
be termed the elements of the ensemble. 

Each element in the ensemble is represented by a point in phase 
space. For the oscillator ensemble, these points form the loci of the 
ellipse in Fig. 6 1 if the energy of the system is fixed or, if it is not 
fixed, occupy the whole family of ellipses corresponding to the allowed 
range of E. Consider the latter case. In any given area of the phase 
space there will be at a given instant a certain number of phase points 
corresponding to elements of the ensemble. This enables us to intro- 
duce the concept of phase density. About any point in phase space 
we construct a small region and take the limit of the number of phase 
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points divided by the size of the region as the latter grows smaller. 
Clearly to give meaning to this quantity the number of elements must 
be very large and distributed more or less continuously. We shall 
refer to the phase density as D. 

Consider again our illustration of the oscillator and suppose the 
ensemble consists of N elements with energies included between £j and 
£ 2 . The average density of the corresponding points in phase space 
will then be 

p-JL J* 

2 t v m (£ 2 - Ei) 

in the phase region between the two ellipses characterized by £2 and 
Ei respectively and zero everywhere else. The actual density may be 
constant throughout the region and equal to the average, or it may 
vary and be a function of p and q. For example, the density may vary 
directly with the energy, it., D = KE. Let us investigate this for a 
moment. The total number of elements of the ensemble is given in 
perfectly general fashion by 



where dtp is the element of volume of the phase space, and is given in 
general by 

dtp = dq\ • • • dqj • dpi • • dp/. (33) 

The integration is carried out over the whole portion of phase space 
occupied by the ensemble. In the present illustration dtp — dpdq and 
(32) becomes 

N = 



the integration being taken over the whole of the appropriate phase 
space, i e., the area contained between the two ellipses for Ei and £ 2 . 
We can write simply 


dtp = 2ir 



dE, 


and obtain 



(£S - -E?). 


If we speak generally instead of a specific illustration, we may say 
that D will be a function of the pi • • • p/, qi • • • qj as well as the time, 
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and the number of elements of the ensemble in dp will be • 

dN = D{p } , q,, t)d<t>. (34) 

If D does not depend on the time explicitly the ensemble is termed 
stationary. In general, however, we expect D to change with the time 
so that the phase points occupying at a given instant a certain portion 
of the phase space may be thought of as moving out of that region into 
another one with the passage of time. 

Since the element of volume dp is to play such an important role in 
statistical mechanics, it is of importance to see how it behaves with 
respect to an arbitrary transformation of coordinates in configuration 
space. Suppose we transform the phase variables 2i • • • q/, pi • • • pf to 
the system Qi • • • Q/, P\ • • Pf, where 

Qi = Qiki *••?/) ^ 

P) = PMi •“ Qj<P\ * • ■ Pf)- 

From the property of the functional determinant already alluded to in 
eq. (64) of Chapter V, we have 


dQx 


dQfdP\ • 
d(Pi 


dPf 

■ ■ Pt, Qi 


Qs ) 


d(px • • • Pf, qi ■ • qj) 


dqi • • • dqj dpi • • • dpf. (36) 


The effect of the transformation will be clear from an evaluation of the 
Jacobian, which is of the form 




d_Pi 


d Jk 



dpi " 

dpi dpi 

dpi 



dPi 

dPfdQi 

dQf 

B{Px • 

■Qf) 

dpf 

dpf dpf 

dpf 

dipi ■ 

• Qj) 

dPi 

dPf dQi 

dQf 



dqi 

dqi dqi 

dqi 



dPi m „ 

MjrOQi" 

dQf 



dq f 

dqf dq f 

dqf 


(37) 


. dQ, 

But since —— = 0 for all j and k from 1 to / the determinant at once 

opk 


* Note that D{pj, q„ t) is an abbreviation for D(p, • • • pt,g\ • • • gj, t) as usual. 
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breaks down into the product of two determinants, i.e., 



dP\ dP/ 


dQi 

dQf 

d(Pi •••<?/) 

dpi dpi 


dqi 

dqi 

d(Pi ■ • • qj) 

dPi dP/ 

dpi dpj 


dQt dQf 

dqi dq t 


Let us denote the transformed Hamiltonian by K{P t • ■ P f , Q t ■ • • Q/). 
Then from eqs. (12) and (20), since the system is assumed to be 
conservative, 


Consequently we have 



dP 3 d dK ~ d 2 K dQi 
dpk dpk dQ, dplc ’ 


which makes it possible to write 

dP, a dK dpi a 9TZ 
dpk dQ, jri d(h dpk d(t] dpk ’ 


(40) 


since the Qi are linear functions of the p’s with coefficients involving 

dQi dK 

the q’s and therefore — is a function of the q’s alone. But — = 

1 a a * a j. 


an 

dpk 

But 


dpk 

— fa and hence (40) yields 

dP, _ d([k 
dpk ~ d& 


whence 




dqk 
fa Wi 


dpk 


(41) 


dqk _ dq k _ dP, 
dQ } dQ, dpk 


The expression on the right of (38) then becomes 


dqi dq } 


dQi dQf 

dQi" dQi 


dqi dqi 

dqi dq/ 


dQi dQf 

dQ/'" dQ f 


dqf dq } 


( 42 ) 
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Now, from the general rule for multiplying determinants, if we denote 
by C t} the ij term in the product determinant, we have 


C„ 



dqt dQj 

dQ, dqt 


3Qj 

dQi 


&)*> 


(43) 


where S } , is the Kronecker symbol which equals unity for i = j and is 
zero otherwise. Hence the product determinant is just the diagonal 
determinant 

1 0 0 0 ••• 0 
0100-0 


0 0 0 0 ••• 1 


j whose value is unity. Since the functional determinant is unity we 
, have demonstrated the invariance of the volume element d4> in phase 
i space with respect to an arbitrary point transformation. 

5. LIOUVILLE’S THEOREM. THE EQUATION OF CONTINUITY IN 
STATISTICAL MECHANICS 


Let us now consider a region of phase space and the “flow” of 
phase points through it. We can think of this region as enclosed by 
a hyper-surface whose “area" element will be denoted by da. We 
then generalize the divergence theorem of ordinary three-vectors to 
the 2 f dimensional phase space. Thus for a generalized vector function 
of the phase space, V, with components V 9l , v qi ■ . • V t/ , V Pt • • • v p , 
we define the divergence of V as 



If we denote by V„ the component of V normal to da the divergence 
theorem becomes 


/«■ 


(45) 


where the "volume” integral on the right is taken over the whole 
phase region under consideration and the “surface” integral on the 
left is taken over the hyper-surface enclosing this region. Now apply 
this to the case where V is the vector with components D$i, • • • Dq n , 
Dpi • • • Dp n . Thus Dili is the rate of flow of phase points per unit 
area in the qi “direction.” The vector V may then be said to represent 
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“phase flow," and 



Vnda represents the net phase outflow from the 


phase region bounded by the hyper-surface over which the integration 
is taken. But this phase efflux must be compensated by a change in 
phase density inside the volume. In symbols 


whence from (45) 




(46) 

(47) 


Since the phase region over which the integration is extended is arbi- 
trary, the result in (47) may be written in general 


9D d(Dp,) \ 

dt fzi\ dq, dp, J 


(48) 


It will be noted at once that this is analogous to the equation of con- 
tinuity in hydrodynamics. It can be written in physically more 
significant form by carrying out the indicated differentiations. Then 


+ jA'in! + °h 

dt ^ 




= 0 . 


But from the canonical equations (19) it follows that for any j 


Moreover 


dq, + dp, 




) 


is the time rate of change of D due to the changes in the p's and q’s. 
Consequently (48) becomes 



(49) 


which says that the total time rate of change of D, the phase density, 
vanishes. This theorem, known as Liouville's theorem, means that to 
one following the motion of the phase points in an ensemble the density 
does not change with the time. At any given place in phase space, the 
density may change, to be sure, but what may be called the motional 
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change vanishes. Thus if we consider a certain region of phase space" 
containing at a given instant a certain number of phase points, during 
the passage of time these points move in such a way as to occupy an 
equal phase volume at every instant, even though the “shape" of the 
volume may alter. This result is referred to by Gibbs as the principle 
of conservation of density-in-phase? 

The importance of this theorem for statistical mechanics can 
scarcely be over-emphasized. Not only is it basic for the further 
theoretical development but it has also been used directly to simplify 
physical problems. One of these is the study of the motion of electrons 
in the earth's magnetic field which is of great significance for the 
investigation of cosmic rays. 7 8 

Gibbs also has shown how Liouville’s theorem may be used with 
little or no further mathematical manipulation to prove the invariance 
of the element of phase volume.* 


dD 

In the special case of a stationary ensemble — = 0, and the theorem 
• at 


becomes 



(SO) 


It can be shown that if D is a function of the energy E alone (50) 

.. t dD 

follows directly. Hence Liouville’s theorem will lead to — = 0, i e. 

dt 

the ensemble is stationary. The proof follows. Here 


AD _ ai> cLE 

dq,- dE dq, ’ dpj dE dp } 

But from the canonical equations 

— = — = and = ?E = 

dq, dq, ~ P} dp, ~ dp, = ** 

Hence 

and therefore (50) follows. Therefore an ensemble for which the phase 
density is a function of the energy alone must be stationary. 


7 For an alternative derivation which is purely analytical in character, cf. Lindsay 
and Margenau, op. cit., pp. 221 ff. 

• Cf. G. Lemaitre and M. S. Vallarta, Phys. Rev., 43, 87 (1933). 

* Gibbs, op. ctt., p. 11. 
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A simple illustration of Liouville’s theorem is provided by a system 
consisting of a single charged particle with charge e moving in a uni- 
form electric field of intensity F. The Hamiltonian function for such 
a particle is 

where it is assumed that the field is directed along the positive q axis. 
An ensemble representing such a system with constant energy 2?i 
would have elements corresponding 
to the phase points located on the 
parabola Ei in the pq plane (cf. Fig. 

6-2). Let us, however, imagine an 
ensemble with phase points located 
in the phase region fa between the 
two parabolas corresponding to total 
energies Ei and E 2 respectively and 
between the parallels to the q axis 
defined by the values pi and p 2 
respectively. If the phase values 
of the elements of this ensemble 
correspond to / = 0, at the expira- 
tion of time t the phase points of 
the ensemble for the system will no Pig 6 2. 

longer lie in fa. Rather they will 

have moved into the new phase region fa bounded by the momen- 
tum values p[ and p^, where 

p'l — Pi + P 2 ~ p2 “h M 

From the canonical equations, p = eF. Now the phase “volume” 
occupied by the ensemble at t = 0 is the area 



fa 


(E 2 - Ei) 

eF 


{p2 ~ Pl)t 


while the phase “volume” occupied by the ensemble at time t is 


* - 


But from the expressions for p\ and p\ it follows that p\ — p\ = 
p 2 — pi and hence fa = fa. As time passes the same set of phase 
points occupy equal “volumes" of phase space. Therefore the density- 
in-phase must remain the same, as Liouville’s theorem requires. The 



118 


CLASSICAL STATISTICAL MECHANICS 


[Ch. VI 


reader will note that the shape of the successive phase regions occupied 
by the ensemble changes though the “volume" is invariant. 


6. THE FUNDAMENTAL POSTULATE OF STATISTICAL MECHANICS 

It is now necessary to provide some connection between the 
ensemble, which is a purely abstract mental construct, and the dynam- 
ical system whose behavior the ensemble has been created to describe. 
We do this by means of a fundamental postulate, the statement of 
which, however, demands the introduction of an additional concept. 
This is the phase probability. If we have N elements in the ensemble 
and the phase density is D, the probability of choosing at random an 
element whose phase is included in the region dp about the phase point 
Qi •• • g/, Pi ‘ Pj, is defined to be 

Pdp = D/N-dp, (51) 


where P(q ]t p Jt t) is called the probability coefficient, and we obviously 


have 


Evidently from (32) 


P(pji ffn 0 ~~ 


DiPj, gj, 0 

N 


f Pdp = y* Ddp = 1. 


(52) 

(53) 


That is, the probability of choosing an element at random from the 
whole phase space of the ensemble is unity. 

The basic postulate now runs as follows: The probability that at a 
given instant, t, the physical system being described shall have its 
phase included in the region dp about the value qi ■ ■ q/, pi • ■ • pf is 
the same as the probability Pdp oi choosing at random from the 
ensemble corresponding to the physical system an element included in 
the phase region dp. It is essential here once again to distinguish 
between the actual physical system which assumes its possible phase 
values one after another as time passes and the imaginary copies which 
compose the ensemble, whose elements correspond to all possible phase 
values of the system visualized simultaneously. 

definite physical meaning for the average over the ensemble of any 
physical quantity characteristic of the system being described. Thus 
if x (qv • • ?/. Pi • • • Pf) is a physical quantity which is a function of 
the q’s and p’s of the physical system, we shall define the average of x 
over the ensemble as 

X = J xPdp, 


(54) 
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the integration being taken as usual over the whole phase space occu- 
pied by the ensemble. From the fundamental postulate it now follows 
that x is the value of the quantity x which we should expect on the 
average to measure for the system when it is in a state of equilibrium, 
viz., the only state in which we can contemplate the act of measure- 
ment. The calculation of average values over the ensemble will then 
be the principal task of statistical mechanics, for these averages are 
taken as the actual physical values encountered in measurement and 
entering into the laws characterizing the experimentally observed 
behavior of the system. The essential difference between the average 
over an ensemble and the kind of average considered in the statistical 
distributions of Chapter IVjhould be carefully noted. Thus, for 
example, the average energy E in eq. (30) of Chapter IV refers to the 
average energy per particle in an actual physical aggregate of particles 
canonically distributed. The average defined by (54) is on the con- 
trary an average of the quantity in question for the whole physical 
system described by the ensemble. This point will be illustrated and 
emphasized again in the study of special types of ensembles (cf. the 
end of Sec. 9 of this chapter). 


7. THE MICROCANONICAL OR ENERGY-SHELL ENSEMBLE 

It is now necessary to consider some special types of ensembles, for 
clearly the fundamental postulate of the previous section can not be 
applied unless we have a definite distribution of elements to which to 
apply it. Let us consider the ensemble whose elements have energies 
lying in the interval from E 0 to E 0 ■+■ A E, for which in other words 

Eq $ H(q i qj, pv ’• Pi) ^ -Eo + AE. (55) 


This interval constitutes in phase space what may be called an energy 
shell. We shall assume that the phase points of the ensemble fill this 
shell everywhere densely. The ensemble itself is usually called a 
microcananical ensemble, a name whose significance will be understood 
better later. As a matter of fact it might more appropriately be 
termed an energ^-sHell ensemXAe. 

Now from Liouvillc’s theorem dD/dt - 0; if further we suppose 
that the ensemble is stationary, i e , dD/dt — 0, it follows from (50) 
that there is no change in phase dcns.ty along any phase 
along every phase curve the density remains constant both in space 
andtime From this we infer that D has the same value throughout 
theenergy shdl. To be sure, in order to reach this conclusion, we have 
made a £dt hypothesis, namely that every possible phase curve of the 
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system fills the shell densely or at any rate that each such phase curve 
passes infinitely close to every point of the shell. This is the cele- 
brated ergodic or quasi-ergodic hypothesis. It has received much 
attention in discussions of statistical mechanics. It is not difficult to 
show that the strict assumption that the phase curve passes through 
every point in the energy shell or on the energy surface leads to logical 
contradiction. The strict ergodic hypothesis is therefore usually 
replaced by the less stringent quasi-ergodic hypothesis that ultimately 
the phase curve passes as close as one requires to every point in the 
shell. Applied to the actual physical system the hypothesis effectively 
means that the system will ultimately get infinitely close to every 
possible phase value consistent with the restriction to constant energy 
or limited energy variation. So stated, the assumption has a plausible 
ring. The purely mathematical problems which it poses, however, are 

considerable. For a consideration of these 
the reader is referred to P. S. Epstein.*® 
Let A be the phase volume of the 
energy shell. What we wish to do is to 
specialize eq. (54) so as to obtain a con- 
venient expression for the average of any 
function of the phase over the micro- 
canonical ensemble. We construct the 
diagram shown in Fig. 6 ■ 3 where a por- 
tion of the energy shell is shown sche- 
matically. Here da represents a portion 
of the energy “surface” defined by 
H (pi ■ • -p/, qi - •• q f) = E 0 and A $ is the “normal” to this surface 
which extends to the other bounding surface of the phase volume 
occupied by the ensemble, viz , H = Eo + AE. Then for the element 
of phase volume of the ensemble we may write 



where further A<f> = / 
We now get ^ 


d<t> = As -da, (56) 

d<f> over the space between the energy surfaces. 


since J' Dd<)> — N, where N is the total number of elements in the 
ensemble, and we can take D outside of the integral sign because of 


D d<b 

Pd* = N'd+ = T4,’ 


(57) 


l# " Commentary on the Scientific Writings of J. Willard Gibbs,” pp. 465 ff. 
Yale University Press, 1936. 
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its constancy. But now we can write to an approximation which 
improves as AE grows smaller 

AE , , 

— = |V//|, (58) 

where | Vi? | is the absolute value of the generalized gradient of the 
Hamiltonian, viz., 

Its value is of course taken at the energy surface. The volume A<f> of 
the energy shell depends on E and AE. In fact we can write 


As a result of Eqs. (56-60) 


Atj> = u(E)AE. 


Pd<t> — 


w{E)\vH\' 


This enables us to express the average value of any phase function 
x(qi • • q/, pi • • pj) over the microcanon ical ensemble in terms of 
an integral taken over the inner surface of the energy shell. Thus 
from (54) 

(X) “° =/« (62) 
Before going further with the general treatment, let us specialize 
the above to the case where 
the dynamical system is a | 

simple harmonic oscillator. 

This enables us to visualize ^ N, \ > \ 0+ ^ 

a little better what we are / f \ 

doing. In Fig. 6 4 we re- — T~T ) ) ^ 

present the energy shell for ( 

the ensemble as the plane xy 1 — - 

area between the two el- * 

lipses whose equations 

are H = p 2 12m + kq 2 / 2 = Fig 6 4 

Eq and p 2 /2m + kq 2 / 2 = 

Eq AE. The element of “surface" da of the shell now becomes the 
element of arc of the ellipse, viz., 


.E.+ AE 
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(64) 


Moreover 

u(E) is obtained from the dependence of 0 (here area ) on the energy. 

Thus since the area of the ellipse for E 0 is 2r Eo, the size of the 
shell is given by 


A<j> 


AE, 


(65) 


whence w(E) — 2ir m/k = constant. We are now prepared to write 
Pdi>. Thus 

da = 


fi+=2£-« 


and 


Hence 


da 

1^1 


r 

m dq 


>/ 2mE 0 — mkq* 


dq 


Pd<f> = -7- \ r mk —> — - 

2ir \'2mEo — 


( 66 ) 


It is interesting to observe that Pd<t> reduces precisely to dt/to, where t 0 
is the period of the oscillator and dt is the time spent in the interval dq. 

As a check on (66) we can show by direct integration that J~ Pd<t> 

equals unity when the integration is extended over the whole shell. 
The evaluation of the microcanonical average kinetic and potential 
energies is of interest. Thus 


(a 


'2E 0 /k 


p 2 dq 


In similar fashion 


( 


■5 sjT* V 2mE 0 — mkq 2 


kf\ _ Eo 

2 / mo 2 * 


Eo, 

2 ’ 


(67) 


( 68 ) 


Thus for the simple harmonic oscillator the averages of the kinetic and 
potential energies over a microcanonical ensemble are each equal to 
one-half the characteristic energy of the ensemble. This reminds us of 
the ordinary mechanical theorem that the time average of the kinetic 
energy of a simple harmonic oscillator is equal to the time average of 
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the potential energy. The connection here is indeed an illustration of 
the basic postulate discussed in Sec. 6 

It will be observed that in the above example we can write p 2 /2m as 

• ^ jjrj 

- — — and kg^/2 as - — . This suggests a possible generalization as 
L op 2. dq 

follows: Let us form the average of the quantity 


Then 


dH 

X = Qk — • 

dqk 


i r an dc 

U (E 0 )J qk dq k |VH|’ 


where u(E 0 ) has been removed from under the integral sign since it has 
the same value at all points of the energy surface 
Now 




may be interpreted as the cosine of the "angle” between the normal 
to the energy surface and the q k "direction.” This follows from the 

fact that in analogy to the ordinary theory of surfaces — — • • — — , 

an an ... dqi 8qf 

— • • - — are proportional to the “direction cosines” of the normal 
dpi dp f F ^ 

to the surface 

H(pi ■ • P/,qi • • • qj) = E 0 . 

dll da . , 

Consequently we may look upon as tae projection of da on 

the 2/-1 dimensional "plane” normal to the g* direction. Therefore 

dH Jo_ 

^ dq k \VH\ 

may be interpreted as the “volume element” d<f>* of the 2/ dimensional 
phase volume enclosed by the energy surface. The same will hold true 
of 

dH da 
^apklvn]' 

Hence for both we shall have 

(x)mo = ^ f d4> * * (70) 
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Here 4>* is the total phase volume enclosed by the energy surface. 

dH 

We can make an immediate interesting application. For pk - — = pkik, 

dpi 

from the canonical equations (19), and = H + L = 2 T, where 

T is the total kinetic energy. Hence we may refer to as 27*, or 
twice the kinetic energy associated with the /bth degree of freedom. 
From (70) we therefore have 


(Tk) m = 


4>* 

2«(£o)‘ 


(71) 


But since the right-hand side is constant, this implies the equipartition 
of kinetic energy among the degrees of freedom of the system. 
dll 

Consider further g* — = — Qkpk (from 19). If we treat pk as the 

3gfc 

generalized force associated with the kth degree of freedom, eq. (6) of 
Chapter V suggests that we may define 

0--i(S SSL (72) 


as the generalized virial of the dynamical system under consideration, 
since it is formed in precisely the same way as the virial of Clausius for 
an actual physical system of particles. But from the theorem (70), it 
follows that 


0 = (S T k ) m 


(73) 


or the generalized virial is equal to the microcanonical average of the 
kinetic energy of the system. This is the statistical mechanical 
version of the virial theorem. 

The reader will find it instructive to verify this result for the simple 
harmonic oscillator. 

8. COMPONENT SYSTEMS 


The physical system represented by the ensemble has so far been 
thought of as a unitary whole. We now wish to resolve it into a num- 
ber of component systems assumed to be independent of each other, 
save for the possibility of energy exchange. Thus if the original system 
is the aggregate of all the molecules composing a given mass of an ideal 
gas, a component system might be an individual molecule. Denote the 
resultant system by 5 and its k components by Si, 5 2 , • • ■ 5*. If S,- 
has/, degrees of freedom and / is the total number of degrees of freedom 
of S 
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It should be clear that the same phase space cannot serve for the com- 
ponent and resultant systems. For the resultant system we shall 
employ what is called a y space, for the component systems the spaces 
will be denoted by mi» a *2 ' • • n k- The phase space n, has 2/y dimen- 
sions. If we are dealing with molecules in the form of particles with 
three degrees of freedom, each /x space has six dimensions, and if 
there are N molecules in the resultant system, the y space has 6 N 
dimensions. 

Though we shall conceive the system S to be closed and hence 
representable by a microcanonical ensemble, it would restrict matters 
too much to assume the same for the components, since we should like 
to think of the components as having the possibility of changing their 
energy by transfer from one to another without altering the energy of 
the whole. For this reason we shall not be able to construct micro- 
canonical ensembles for the component systems, and the phase points 
for Sk may occupy any part of /«*,. But the phase probability P for 
the resultant system will presumably somehow depend on the instan- 
taneous energy of the system and we shall in fact assume that P is a 
function of E alone. For each component system there will also be 
a phase probability Pj = P,(Ej) where E, is the (variable) energy of 
the jth system and XE, — Eq. If the ensembles representing S and S, 
are stationary, P and P , will be independent of the time. The problem 
now is to find the functions P,(.E } ). The probability that the phase 
points for system Si lie in d<t> i of space m, • • • those for S, in dty of 
space tt } , etc. will be by definition 


P\d<t>\ • P2d<f>2 • • P kd<t>k • 

But this must equal the probability that the ensemble for S shall lie In 
d<t>, since the systems have been assumed to be independent. Hence 

P 1 P 2 • • Pkd<t>i • d<l>k = Pd<f>. (75) 

Since d<t> = dfa-dfa • • • d<t>k, we get the functional equation 

Pi (Ei) P*(E fc ) = P{Ei + E 2 + • ■ • + E k ). (76) 

We can write (76) in the form 

P.(E.) = log P, 

»Ti 


(77) 
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and if we alter E, by dE t subject to the condition that = dE$ = 0, 
the result is 


IV 

p, = d log p, 




or 




(78) 


dP 


By equating corresponding terms and recalling that — - is the same 

oE t 

for all i since P is a function of the total energy only, we arrive at 


1 dPi 1 dP 2 

Pi dEi Pa dE% 


1 dPk 

y t dE„~ “ I ’ s,ant “ C - 


(79) 


where C is a negative constant and will be written in the form —1/0. 
Therefore for any j from 1 to k the dependence of P, on E, takes the 
form 


P, = 


(80) 


The C } are multiplicative parameters to be evaluated by further con- 
ditions placed on the P } 


9 . THE CANONICAL ENSEMBLE 

The immediately preceding considerations at once suggest a new 
type of ensemble in which the energy is not restricted to lie in a shell 
but is allowed to vary con tinuously, and in which the fractional number 
of elements of the ensemble per unit volume of phase space is not 
constant but varies with the energy. Thus following Gibbs we shall 
introduce an ensemble such that 

Pdp = &-*&’’ V ' VI pj)/ *d4>, (81) 

with the probability coefficient 

p _ ( 82 ) 

We have here placed C = e* /B by the introduction of the new param- 
eter yf/. The meaning of (81) and (82) is this: For a given system 
the Hamiltonian function is a certain function of the q's and p's. 
The probability of picking at random an element of the ensemble lying 
in the phase element dp about the point (ffi • • • qj, pi • • • pj) depends 
on the position of this point through the Hamiltonian. Consider as an 
example the simple harmonic oscillator. Here 

Pdpdq = e* /e e~ Wim+ma)/B -dpdq. 



Sec. 9] 


THE CANONICAL ENSEMBLE 


127 


For small absolute values of p and q, i.e., small total energy of the 
oscillator, P will be larger than for large values of p and q, which 
correspond to larger total energy. 

In following Gibbs we shall call an ensemble for which the prob- 
ability coefficient has the form (82) a canonical ensemble. Its relation 
to the canonical distribution discussed in Chapter IV will be discussed 
in the sequel. 0 is defined as the modulus of the ensemble. The second 
parameter p can be expressed in terms of 0 by the relation (from 53) 

e~* /e = f e~ H/ °d<t>, (83) 

the integral being taken over all the portion of phase space occupied 
by the ensemble. There would appear to be a certain difficulty about 
this equation. Inspection shows that it is not dimensionally correct, 
since on the left we have a pure number, while dp on the right 

has the dimensions of phase space. We can correct this situation 
merely by thinking of dp as a pure number giving the ratio of the 
genuine volume element in phase space to an arbitrarily chosen unit 
volume. This will still enable us to compute ensemble-averages by 
means of eq. (54). 

In the canonical ensemble we have a means of representing statisti- 
cally a physical system of variable energy, i e., an unclosed system like 
the component systems S, mentioned in Sec. 8. We can then look 
upon the aggregate of such systems S Jt each represented by a canoni- 
cal ensemble, as a system of constant energy E 0 represented by a 
microcanonical ensemble. From thispointof view a canonical ensemble 
may be interpreted as a component part of a microcanonical ensemble. 
There is, however, another possible point of view which considers a 
microcanonical ensemble as a part of a canonical ensemble. Suppose 
all the component systems S } are dynamically similar with the same 
number of degrees of freedom, etc., e.g , similar molecules. Then they 
may all be represented in a single phase space which we may call the 
u space. We may now divide the p space into a set of energy shells 
within each of which the phase density is constant but with P varying 
from shell to shell according to (82). Here the microcanonical ensem- 
ble for each system S, appears naturally as a component part of a 
canonical ensemble for the same system. To a certain extent this 
justifies the name associated with it. 

Let us now find the probability that the element of a canonical 
ensemble shall correspond to energy lying in the interval from E 0 to 
Eq + AR By the fundamental postulate of Sec. 6, this is equal to the 
probability that the dynamical system represented by the canonical 
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ensemble shall be found in such a phase interval that its energy lies 
between Eo and E 0 + AE. But this probability is simply 

(84) 

» 

where A4> is the elementary volume in phase space corresponding to the 
energy shell in question. 

Rather more important is the calculation of the average value of any 
dynamical quantity over a canonical ensemble. Thus if x(ffi • • • qj, 
pi - • pf) is such a quantity, we have from (S3), (54), and (82) 


(x) 0 = = J'xe-^d* jj e~ B/9 d<t>. 


(85) 


Since (85) eliminates the parameter \p it is often the more convenient 
form. The integrations in both numerator and denominator are taken 
over the whole of the phase space occupied by the ensemble. 

As an illustration consider the average energy of an oscillator. 
It will be simplest to express the integrations in terms of the energy 
itself by using the ener gy shell as the elementary phase volume and 
writing d<f> — 2ir\S m /k dE. Then 


(£)« 



Ee~ B/9 dE 


e~ s ' 0 dE 


( 86 ) 


For the sake of simplicity we use the limits 0 and » , though strictly 
speaking the upper limit should be finite. Because of the exponential, 
however, the error due to this change will be slight. The result of the 
integration turns out to be 

(E) e = 0. (87) 

Incidentally the reader may show that the same result is obtained 
from the direct expression, using the Hamiltonian of the oscillator, 


(E)c = 


+ 00 

If 


(p 2 /2m + kq 2 /2)e- W2m+k * /2)/ *dpdq 


+» 

ff e~ W2m+ks ' /2)/e dpdq 


In attempting to generalize the preceding results we proceed to find 
the canonical average of 4>*fu, where 4>* is the total phase volume 
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enclosed by the energy surface H = E 0 and u(E) = d<t>*/dE (cf. eq. 60). 
Thus 


f (**/ 

= * 


J.‘ 


-H/8 


( 88 ) 


d<(> 


where 0 is the total phase volume over which we are taking the av- 
erage. If we now replace d<t> by a(E 0 )dE in the numerator, the ex- 
pression (88) becomes 


(<t>*/w) e = 


/ 


d>*e~ B/B dE 




-s/e 


(89) 


'd<t> 


A partial integration of the integral in the numerator yields 


(<t>*/u) c = 


—<f>*@e~ s/B 

EmiLT 

. +e J 

f e~ E/B d<t>* 
* 

t 

f e- E/B d<t> 


(90) 


If now we take E mKX large enough the integrated part will vanish since 
the term e~ E/B will wipe out 0*0, as long as 0* depends algebraically 
on E, which will be true in practice. Moreover in the integral in the 
denominator we may replace d0 by dcj>* merely by a transformation of 
coordinates, which results, for integration over a sufficiently large 
volume of phase space, in 

(**/«). - e. (9i) 

This provides an interpretation of the modulus of the canonical 
ensemble. Now in eq. (71) we have already shown that is itself 
the average of 2 T, (or twice the kinetic energy associated with the jth 
degree of freedom of the system) taken over a microcanomcal ensemble 
with elements filling the energy shell -Eo, Eq + A E enclosing the phase 
volume 0*. What we have shown in (91) then reduces to this: the 
canonical average of the microcanonical average kinetic energy corre- 
sponding to each degree of freedom of the system is equal to 0/2, i.e., 

WihJc - f - t ;. ( 92 ) 


We use the single bar to denote the generalized average of Tj. Equa- 
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tion (92) signifies equipartition of kinetic energy among the degrees of 
freedom of the dynamical system. The generalized average of the 
total kinetic energy becomes / 8/2, if there are / degrees of freedom. 
Our present procedure amounts to thinking of the canonical ensemble 
as consisting of a set of microcanonical ensembles. The generalized 
average then naturally is the canonical average of the microcanonical 
average. 

It is important at this state to compare the meaning of the canonical 
average of a physical quantity, like the energy, over the ensemble 
representing the system, with the averages encountered in the simple 
statistical considerations of Chapters II and IV. In the so-called 
canonical distribution of Chapter IV, we found expressions for the 
average energy, e g , eq. (30) of that chapter. This represents the 
average energy per particle of a dynamical system of particles when 
canonically distributed. For example, in a system of free particles, as 
in Sec. 5 of Chapter IV we found the average energy to be E = 30/2 
(eq. 86 of Chapter IV), where 0 is the modulus of the canonical distri- 
bution. Now it must be emphasized that the average energy over a 
canonical ensemble means something quite different from this. The 
average of a quantity over an ensemble refers, as has already been 
stated in Sec. 6 of this chapter, to the expected measured value of the 
quantity for the entire dynamical system in a state of equilibrium. 
Thus, in so far as a canonical ensemble is a suitable statistical repre- 
sentation of a system in equilibrium, the canonical average of the 
energy is the expected value of the energy of the system when we 
measure it in a state of equilibrium; it is not the average energy per 
component particle of the system. It is essential that this be realized 
in order that a correct understanding be had of the relation between 
the Gibbs ensemble method and the Maxwell-Boltzmann method of 
statistical distribution. In the latter method we deal with systems in 
which the total energy is constant and given and we are interested in 
finding out how the particles of the system are distributed with respect 
to their energy. The canonical distribution is that corresponding to 
maximum probability of realization subject to the constancy of the 
total number of particles and the total energy. We hoped to find and 
did indeed find relations among the various quantities defining the 
canonical distribution which correspond to known thermodynamical 
laws. In the Gibbs statistical mechanical method the energy of the 
system is no longer looked upon as an absolutely constant quantity. 
It may indeed fluctuate, but only about an average which it is the 
task of the theory to calculate. This average should be the experi- 
mentally measured value. It is also the further task of the theory to 
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develop relations among quantities defined by the ensemble which will 
correspond to the laws of thermodynamics. The virial theorem and 
the equipartition of energy are cases in point. Further illustrations 
will be developed in what follows. 

In his development of statistical mechanics Gibbs confined himself 
largely to the use of the canonical ensemble, and we shall follow this 
practice in the remainder of this chapter. As the reader has already 
observed from Sec. 8 the canonical ensemble arises naturally as a 
suitable representation of a component system whose energy may 
change by exchange with another component of the same macroscopic 
system, it being understood that the energy of the latter remains 
constant. The canonical ensemble is therefore well fitted to represent 
a system whose energy varies. However, it will later become clear 
(cf. Sec. 11) that it is also able to provide a very satisfactory statistical 
representation of a system whose energy is constant. The canonical 
ensemble is subject to easier analytical manipulation than the micro- 
canonical ensemble. If it can accomplish the same descriptive ends as 
the microcanonical ensemble its use will lead to considerable economy 
of thought. We may make one further remark at this point. We shall 
later associate the physical concept of temperature with the modulus 
0 of the canonical ensemble. This type of ensemble therefore repre- 
sents a system at constant temperature. 

10. THE MAXWELL-BOLTZMANN DISTRIBUTION LAW 

One of the searching tests of any statistical theory of dynamical 
systems is its ability to lead to the proper law for the distribution of 
velocities. We have now to examine this problem from the standpoint 
of Gibbs’ statistical mechanics. We must first construct a canonical 
ensemble for a system of n free particles all of mass m having /( = 3n) 
degrees of freedom. The Hamiltonian for such a system is 

H + Pvi + ( 9 ^) 

i - 1 

The phase probability in the corresponding canonical ensemble is now 
(writing p] = p% + p\, + p\„ for short) 

n n 

ZpJ/ 2 m 8 n dxj dy, dzj I \dp*i dpyj dpt j 
Pd4 - ' zi — ; • ( 94 > 

/ ( 2 /) / e~ Zp J /ama n dxj dy, dzj n dpx, dpy, dpt, 

J J /- i /-i 
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where the integral in the denominator is 2/ fold. This, of course, 
gives the probability associated with an ensemble element of vol- 
ume dp in the neighborhood of the phase x u y u z u pxi, Pyt, pit, * • ■ 
y n , *n, Pxn, Pvn, pm . From the fundamental postulate of Sec. 6, 
this is the probability of finding the actual system of f degrees of 
freedom in the state for which the first particle has coordinates in the 
neighborhood of *i, yi, Z\ and momentum components in the neighbor- 
hood of pxi, p v u pti, and correspondingly for the other particles. 

Now we actually wish the expression for the probability that any 
one particle (for simplicity we shall take the one denoted by sub- 
script 1) shall have its phase as indicated while the other particles are 
distributed in any fashion whatever consistent with the given total en- 
ergy and finite volume. From the fundamental postulate and eq. (94) 
this will be 


Pidpi = 

*» » n 

{If -6) /•-£*“ n dx,dy,dz, • n dpxj dp yJ dp t j 

J 2 j-2 

A 

(95) 

where A is the same denominator as in (94) and dp, = dxydyidzi- 
dpxidpyidpti. Note that the integral in the numerator is now (2 f — 6) 
fold. If further we wish the probability that the first particle shall 
have its momentum components in the neighborhood of p x i, pyt, pit 
while its coordinates are unrestricted we have further 


P i dpi 


-^"tdPxi dp v t dp tl f (/— 3) f JJ dp X} dp V} dp, s 

** ** j m2 

J U) J e-k- n * n dpx, dpy, dp z) 


(96) 


The integrals in (96) are easy to carry out by iteration of the well- 
known integral I JJf e~ (p>x+p * +pi,)/2m ® dp z dp v dp, = {2irm®) fi . The 


result is that 


P[dp i = {2rm®)-\-< p « +p » +p » )/2m *dPxidPy 1 dp, 1 . (97) 


In place of the first particle we might have taken any particle. We 
therefore reach the conclusion that the average fractional number of 
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particles having their momentum components in the interval p x ,p x + 
dp z , • • • becomes 

^ = (2* m®)-\-^ +1, l + ^ 2mB dp x dp v dp t . (98) 

If further we replace p x by mv x , etc., this would take the precise form of 
the Maxwellian velocity distribution (50) in Chapter V, provided in- 
deed we are entitled to replace the characteristic parameter 0 of the 
ensemble by kT, The temptation is therefore strong to look upon 
(97) and its equivalent (98) as the statistical mechanical version of the 
Maxwell-Boltzmann distribution law and to consider the parameter 0 
as the statistical mechanical analogue of something proportional to the 
thermodynamic temperature. 

It is worth while pointing out that the restriction to absolutely free 
particles is not necessary. We can introduce a potential energy func- 
tion V(x )t y„ Zj) into the Hamiltonian. As long as it is a function of 
the coordinates only, its influence will cancel out in the integration 
over the coordinates and the final result will still be (97). Of course we 
can then no longer maintain that any position coordinates for the 
system are as likely as any other : as a result of the forces on the system 
certain parts of configuration space will in general be preferred by the 
system, depending on the nature of V{x„ y„ z,). 

A further observation is in order on the statistical mechanical 
interpretation of the Maxwell-Boltzmann distribution law. In the 
immediately preceding discussion we have tried to develop the analogy 
with kinetic theory by constructing a Gibbs ensemble for a whole 
system of free particles. This seems to be the natural course to pursue, 
but it is interesting to observe that we can achieve the same result by 
forming the ensemble for a single particle. Let us see how this comes 
about. The Hamiltonian for a system consisting of such a particle of 
mass m, having kinetic energy only, is 

H = {fil + Pi + p\)/2m. (99) 

The phase probability for the corresponding canonical ensemble is 


e <4+pJ+^/ 2me dx dy dz dp x dp v dp, 
ffffff e ~ (p * +p * +p ‘ )/2me dx dy dz dp x dp v dp. 


It must be emphasized that in constructing the canonical ensemble for a 
single particle we cannot assume that the energy of the particle is 
fixed. By the very nature of the canonical ensemble we must assume 
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the possibility that the energy of the system being described shall be 
able to fluctuate. The particle we are here talking about is one whose 
energy can then have any value. It cannot therefore be considered 
free in the usual dynamical sense, even though we are treating its 
energy as wholly kinetic. 

If we carry out the integration in the denominator of (100), we can 
write 


Pd4> 


e (pl+pj + pj)/2me fa fip^ 

r(2irm®) h 


( 101 ) 


where r is the physical volume in which the particle is confined. If 
further we require the probability that the phase point have momentum 
values Px, Pvt Pr in the indicated interval with no restriction on the 
coordinates x, y, z, we obtain 

P'd<t>' = (2«n0r*e~ <«4+4+4>/*»e dp x dp y dp t . (102) 

This is identical with eq. (98) which gives the average fractional num- 
ber of particles in a whole aggregate of free particles having their 
momentum components in the indicated interval. It therefore appears 
that we need not have considered the whole aggregate in forming the 
canonical ensemble. A single member suffices. Closer inspection of 
the situation indicates that the reason for this is that the particles in 
the aggregate are free and do not affect each other. If the particles 
were to act on each other with forces, the Hamiltonian would no longer 
be of the simple form (93) and the possibility of replacing the whole 
Hamiltonian by that for a single particle in forming the ensemble would 
no longer exist. 


11. DEVIATIONS OF QUANTITIES FROM THEIR AVERAGE VALUES 

The utility of statistical mechanics for thermodynamics resides in 
the average quantities computed over an ensemble; these are to be 
identified with the physically measured values of these quantities. 
For example, the actually measured energy of a physical system is 
taken to be the canonical average of the energy over the canonical 
ensemble representing the system. On the other hand in the ensemble 
the various elements correspond to different energies and there are 
then actually wide deviations from the average. This situation appears 
rather disconcerting when we are trying to represent a system with con- 
stant energy by a canonical ensemble. Offhand we should be inclined 
to say that it could not be done and that we must represent such a 
system by a microcanonical ensemble. That this is not really always 
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the case, however, is seen from an examination of the magnitude of the 
deviations from the average energy in a canonical ensemble. 

We need not confine our investigation to the energy. Let x be any 
function of the coordinates and momenta of the system. The fluctua- 
tion or deviation of x from its canonical average is represented by 

* - X - X- (103) 

(The bar here indicates canonical average without further specifica- 
tion.) The average deviation is then at once 


4 = (x - x) = 0. 


(104) 


To get something more significant we introduce the 
mean-square deviation or 



fractional root- 


(105) 


We shall take a as a measure of the average deviation of x from its 
average value. For the sake of being specific let x = H and let us 
confine our comments to an ensemble representing an aggregate of free 
particles, i.e., an ideal gas, where 


II = E 



(106) 


As usual / is the number of degrees of freedom of the system. We 


replace 1/0 by z and let J' 

dz J 


e H, d<f> = Q Then 


IIe~ H ‘d<t> = - E Q 
from the definition of canonical average. Moreover 

H 2 e~ H, d<t> = E 2 -Q. 


dz 2 J 


We therefore have 

^ i’Q/*' IdQ/d-f „,AE 

E - E jj-.e-. 

Hence 

- E 2 = 0 IdE _ R 

E 2 _ “ \/ 



(107) 

(108) 


Consequently if the number of degrees of freedom of the system is 
very large, as it will be in the case of a gas, a becomes negligibly small 
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and we are after all justified in using a canonical ensemble to represent 
the system. 

There is another way of looking at the matter which brings out the 
same point. We may compute the fractional number of elements in a 
canonical ensemble having energies lying in the range from E to 
E -f- A E, where E is any energy value. This fractional number will 
clearly be 

AN rB+AB 

(109) 

N Je 

In this expression j s to be integrated over that portion of phase 

space for which the Hamiltonian H lies between E and E + AE. Now 
if we restrict the discussion to an ideal gas of n particles of mass m, 
H has the form (/ = number of degrees of freedom = 3») 



In evaluating (109) it will be convenient to write d<t> = d<t> q d<f> p where 
d<t> q is the configuration coordinate part of the phase space volume 
element and d<t> p is the momentum part. Then 


AN 

N 


_//3 


I. 


B+aB 


.(*- 


H)/B d<t> p 


( 111 ) 


where r = the physical volume occupied by the gas and 



Now we shall let the volume in momentum space occupied by the points 
representing systems whose energies lie between 0 and E be where 
then 


% = 



( 112 ) 


It is clear that % is some function of E, and we must now find out 
what this function is. Let us imagine first that E = 1 (unit of energy). 
The upper limit in (112) is then really equivalent to the relation 


2m 



= 1. 


(113) 


On the other hand if E 


a 2 , the upper limit in (112) is equivalent to 


2m 



= a 


2 


(114) 
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It is dear we can write (114) in the form 


2m 



1 . 


Consequently the integral 

f •/-(? )<(?)• 



( 114 ') 


in which the lower limits are zero in each case and the upper limits 
subject only to (114') is the same as the integral 


(%)*-i 



dpi dp 2 • • • dp/, 


in which the lower limits are zero and the upper ones subject only to 
(113). But the integral for which the energy E has the value o* is 
given by 




where the upper limits are subject to (114'). Therefore we conclude 
that 

i*p)s = E"\* p ) Bm i. (115) 

must then vary directly as 2S , since ($„)*_ j is a constant. This 
constant may be evaluated as follows. Let 

4-, = CE" 2 . (116) 


Going back to (111) we see that we can replace d<f> p by d$ v — 
f/2 CE ,/2 ~ 1 dE. Moreover if we integrate (111) between E = 0 and 
E — oo we get unity. That is 


fCr //3 

2 

f e ( *~ a)/ *E ,/2 ~ l dE = 1. 

i/ 0 

(117) 

We recall that 



A/e 

1 1 

( 118 ) 


’"’Urn#)"* 

Therefore (117) becomes 



fC 

_ /".-MbOT-Ijk _ . 

( 119 ) 

2(2* w0) //2 Jo 



138 


CLASSICAL STATISTICAL MECHANICS 


[Ch. VI 


From the definition of the gamma function there results 

(2 im) //2 = (2xw) //a 

= f/2-T(f/2) “ r(//2 + 1)' 


( 120 ) 


We may now express AN/N as follows 


AN 

N 


1 / (2irm) //a 

(27rm©) //2 ‘ 2 ’ T(//2 + 1) 


X 


•B+AB 


e -B/e E ,/2- i dEi 


which reduces for AE sufficiently small compared with E to the form 


AN _ e-*' e (Ey 2 - 1 

N ©r(// 2 ) ' W 


( 121 ) 


Incidentally the reader may check this final expression by observing 
that the summation of (121) over all E, gives unity in conformity with 
the fact that 

2A N 
N 


Let us now find the “most popular” value of E, i.e., that for which 
AN/AE is a maximum. For this value of E there are in a given energy 
interval more elements of the ensemble than in any other interval of 
the same size. We differentiate AN/AE with respect to E and on 
putting the result equal to zero obtain 

E m P = (|-l)@. (122) 

If/ is very large this is effectively//2-© or the canonical average (106). 
For a system with a very large number of degrees of freedom the 
canonical average energy is also the “most popular” value of the 
energy. This again checks our previous conclusion that for / large we 
do not make a very great mistake in using the canonical average E 
for the energy of the system represented by a canonical ensemble. 

There remains only the task of finding the number of systems hav- 
ing energy slightly different from E mp . Thus, for example, suppose 
E — 1.01 Emp = 1 01 (//2 — 1)0. It is left to the reader to carryout 
the straightforward arithmetical calculation showing that for 1 cm 3 
of gas under standard conditions with / = 3(2.7) X 10 19 

AN(E) _i8xio“ 

AN (Ejn P ) 

This shows that the chance of picking out in the canonical ensemble an 
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element corresponding to energy differing by only 1 per cent from the 
most popular value is practically negligible. The corresponding fluc- 
tuation of the energy of the actual system represented by the ensemble 
from E is therefore also negligibly small. All this of course is predi- 
cated on the large value of /. 

12. THE STATISTICAL MECHANICAL INTERPRETATION OF 
TEMPERATURE 

We have now seen the possibility of representing thermodynamical 
quantities by means of averages over Gibbsian ensembles and have 
learned that the choice of ensemble, i.e , whether canonical or micro- 
canonical, is largely a matter of mathematical convenience. In order to 
make the connection between statistical mechanics and thermodynam- 
ics more definite and convincing it is necessary to establish more 
precisely the statistical mechanical analogues of the fundamental 
thermodynamical state variables, temperature and entropy. As a 
matter of fact in Sec. 10 in the discussion of the Maxwell-Boltzmann 
distribution law we have had an intimation that the analogue of 
temperature is to be found in the characteristic ensemble parameter 0. 
We now wish to place this association on a somewhat firmer basis by 
showing that 0 possesses certain important properties which are pre- 
cisely those of the te mperature in the thermodynamical sense. We 
recall that when two thermodynamical systems are in equilibrium the 
condition that they have the same temperature is that when they are 
put in thermal contact (and of course isolated from all other systems) 
the composite system thus formed will be in equilibrium at the same 
temperature as that of either system before contact. On the other 
hand two systems in equilibrium at different temperatures will not be 
in equilibrium when joined, though they will, of course, approach 
equilibrium with a change in the temperature of both. 

Let the Hamiltonians of the two systems be H A and H B respec- 
tively. The phase probabilities in the canonical ensembles representing 
the two systems will then be respectively 

PA = e i +*- H * )/ *d<t> A , (123) 

P B d* B = (124) 

where we have assumed the same parameter © for both ensembles. 
Now let the systems be joined. The Hamiltonian for the composite 
system will have the form 

H = H a + Hb + Hab> 


(125) 
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where the term H AB refers to the energy of -interaction of the two sys- 
tems. We shall suppose that this energy is very small compared with 
either H A or H B , though there are indeed practical cases in which this 
is by no means true ; these we shall rule out of our present discussion 
and proceed to call H AB zero, effectively. 

Now P A d<t> A denotes the probability of picking at random from 
the ensemble corresponding to system A an element whose phase 
lies in d<t> A , and similarly for B. Let the phase space for A have 2k 
dimensions and that for B have 21 dimensions. Then the phase space 
for the composite system will have 2k + 21 dimensions and the element 
of volume in it will be denoted by d<f> A d<f > B . The probability of picking 
at random from the ensemble of the composite system an element whose 
A component has its phase in d<j> A and whose B component has at the 
same time its phase in d<f> B is, from elementary probability considera- 
tions, 

P A P B d* A d* B = *»a+*-**-**>/%^. (126) 

In other words we think of the combination of the two systems as tak- 
ing place in such a way that each element of the composite ensemble is 
made up by joining two elements of the original ensembles for A and B 
respectively. But the probability of picking out of the composite 
ensemble an element with phase in dtp = d<f> A d<f> B of course is just 

Pd* = e ( * _a)/e d«. (127) 

Now * — * A -f- \ff B since the \p's are constants and no matter what 
precise relation exists between P A P B d* A d* B and Pd*, we must have 
in both cases 

J' P aP B d4> A d4> B = 1 = J' Pd*. (128) 

But from the further fact that H — H A + H B in the ideal case of 
negligible interaction it follows that actually 

P A P B d<t> A d4> B = Pd*, (129) 

which means that when the two original systems having common © 
are joined the resulting system is described by an ensemble character- 
ized by the same @. On the other hand if the ensemble for A had 
been characterized by the parameter © x and that for B by a different 
parameter © B , after joining we should have 

P A P B d* A d* B - e^- a ^ /B ^e { '*B~ H B^ B Bd < t >A d<i> B . (130) 

This would not correspond with Pd* for any 0 and hence no canonical 
ensemble could be formed for the resulting system, i.e., it could not 
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be a system in equilibrium in the theory of statistical mechanics. 
However we must suppose that after a sufficiently long time has 
elapsed the composite system will come to equilibrium by some kind 
of energy exchange between the two original systems. It can then be 
shown-that the modulus 0 of the resulting ensemble has a value lying 
between © 4 and & B . Indeed by the use of the equipartition principle 
it develops that 0 is related to © x and ® B in precisely the same way in 
which the final equilibrium temperature of a mixture of two gases is 
related to the original temperatures of the components. 

It is clear that 0 possesses the chief properties that characterize 
the thermodynamical temperature T. The precise relation between 
0 and T, i.e. f the nature of the constant c in the relation 

0 = cT, 

can be obtained only from a comparison between an empirical thermo- 
dynamical law and the statistical mechanical analogue. If we use the 
equation of state (eq. 14 of Chapter III) for this purpose it is not 
difficult to show that c = k, i.e., that 

0 = kT, (131) 

which indicates that the ensemble modulus 0 in the Gibbs statistical 
mechanics plays the same role as the canonical distribution modulus 
in the classical Maxwell-Boltzmann statistics (cf. Section 3 of Chapter 
IV). 

13. THE STATISTICAL MECHANICAL INTERPRETATION OF ENTROPY 

Closely associated with and fully as fundamental as the interpreta- 
tion of temperature in statistical mechanics is the meaning of entropy. 
The problem is to find a statistical quantity which possesses the 
properties associated with entropy in thermodynamics (Chapter III, 
Sec. 2). Following Gibbs we again begin with a canonical ensemble in 
which 

e -*/e = J e ~ H/9 d4,. (83) 

Now the Hamiltonian of the system is a function of the p’s and q's 
which also involves certain parameters. These we shall label ti, fs ■ • • 

• ■ • . Their number depends on the type of system and the number 
of degrees of freedom. In the simple harmonic oscillator, for example, 
there are two, viz., the mass and stiffness of the system. (Cf. Sec. 3 
of Chapter IV for the introduction of these external parameters in the 
Maxwell-Boltzmann statistics ) Let us now see what happens when 
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the |’s and 0 are varied. This variation has no reference to change in 
time and does not entail changes in the p's and q’ s. Equation (83) then 
yields 

- 1 ■/*-"** - 1 

Multiplication by &e* /B and the replacement of J7e ( * -H)/e dp by 
H yields _ 

** = (4/ ® H) s& *&• (l33) 

If we denote (p — H)/Q by the new symbol ij, so that — H)/Q — v, 
we have from (133) 


Sp — 


+ e(^D *■ 


(134) 


Now if in the defining equation for ij we vary both coordinates and 
parameters, denoting such changes by differentials to distinguish 
them from changes in which parameter alterations alone are involved, 
the result is 

dp — dH = vdQ + &drj. (135) 

But since p and 0 are independent of the coordinates 
dp = 6 p and d© = S0. 


Comparison of (134) and (135) then leads to 

dM=-edv < 136 > 

where we have replaced dH by its equivalent dE, i.e., the change in the 
average energy. Now the last term in (136) represents the average 
change in the energy of the system due to the variation in the external 
parameters. Consequently it can be interpreted as the negative of the 
average work which the system does When the external parameters are 
varied. We shall denote it by —5 W. Therefore (136) becomes 


-&<&, - dE + SW. 


(137) 
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If we introduce the relation (131) between © and T, this takes the 
form 

-kT<Hj = dE+ 5W, (138) 

which looks very much like the familiar thermodynamical expression 
(eq. 4 in Chapter III) 

TdS = dE+ SW, (139) 

(where indeed we previously used A Win place of SW with no difference 
in physical meaning). If we decide to let (138) serve as the statistical 
interpretation of (139) we must identify dS with — kdrj or set 


S = - ki). 


(140) 


with a possible additive constant independent of the coordinates and 
external parameters as well as © and p. This is Gibbs’ statistical 
mechanical interpretation of the entropy of a system. 

We ought to look into the analogy between — rj and the entropy a 
little. For example we recall from Chapter III the statement of the 
second law of thermodynamics in the form that the entropy is a maxi- 
mum for a closed system in equilibrium. Can we show that — rj is 
larger for a canonical ensemble than for any other ensemble with the 
same number of elements and the same average energy? We shall 
call ij — (P — H)/% = log P, the index of phase probability. Now let v 
for the non-canonical ensemble be denoted by ij\ where 


n' = V + &V> 


(141) 


and Aij is an arbitrary function of the p's and g's. Because the number 
of elements and average energy are the same in both ensembles, we can 


write 



f e' + *'dp = 

j e' l d<f> = 1, 

(142) 

and 



j He' +A 'dp = 

j He'd p. 

(143) 

We seek to prove that ij’ > rj. 

This corresponds to 

. 1 
-3 1 

, V 

1 


Though the average r[ is not a canonical average it is calculated over 
its ensemble in the usual way. Thus we wish to show that 


J (v + Av)e' + *'d<l> > / i je'dp. 


(144) 
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Now 

f (7 + A n)e'+*'d4 = f + A,) 

= | f e’ +A M<*> - | y He ,+A ’d* + ^ A,e’ +A ’ d<t>. (14S) 

By using (142) and (143), we can reduce (145) to 

fir i + A,)e’ +A ’d* = | - 1 / He'd* + y Arje ,+A, d0. (146) 


But for the same reason 

y ve n d<t> = j| - ^ f He" d$. 


(147) 


Therefore to prove (144) we must examine y Aij + d<t>, which can 
also be written f (Aije*'’ + 1 — e A, )eV<>. The parenthetical term in 


the integrand is (Ai? — 1)^’ + 1. If we plot this as a function of A 7 , 
we see that it has only one minimum for real values of A 7 . This has 
the value zero and occurs for A 7 == 0. For all other values of A 7 , the 
term in question is positive and greater than zero. Moreover c" is 
always positive. Therefore 



A ’ + 1 - e A V d<t>^ 0 


(the equality sign corresponding to A 7 = 0) for the whole range of 
integration. Therefore either A 7 vanishes, in which case the two 
ensembles are identical throughout, or 

y A7 > 0 


and (144) is substantiated. Hence 

-V>~V'- (148) 


Consequently for a system not in equilibrium and represented by a 
non-canonical ensemble the entropy represented by — 7 ' will tend to 
increase. It is clear that the statistical mechanical analogy —£7 for 
entropy satisfies the “increasing” property of entropy in thermody- 
namics. 
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Further evidence of the suitability of — ij to represent the entropy 
of a thermodynamical system will be found in the fact that —ij is a 
maximum for a uniform distribution of the elements of the ensemble in 
phase space as compared with any other distribution of the same 
number of elements and corresponding to the same limits of the 
phase. The reader can readily show this by a slight modification of 
the method just used to demonstrate — q > — 7- In other words 
—if can be used as a measure of uniformity; the greater the value of 
—If for a given ensemble, the more nearly uniform is the distribution 
of elements. The canonical ensemble is the one which is most nearly 
uniform in distribution of all ensembles with given average energy 
and given number of elements. Obviously the appropriateness with 
which a canonical ensemble can be considered to represent a system in 
equilibrium depends in the last analysis on the extent to which any 
ensemble representing an actual dynamical system tends toward uni- 
formity of distribution with the passage of time, independently of the 
initial distribution. This approach to uniformity cannot be logically 
demonstrated for an arbitrary initial distribution but proofs have been 
given for initial distributions which are not themselves so specialized 
as to be highly improbable. 11 We therefore feel safe in accepting the 
proposition that the Gibbsian analogue of the entropy for a closed 
system tends to increase and practically never decreases. We must 
recognize, of course, that it is subject to the same probability difficulties 
already envisaged in Sec. 3, Chapter IV in the discussion of the Max- 
well-Boltzmann interpretation of entropy. 

One last point remains for consideration: does — krj represent a 
state variable? From the definition 

it follows that after the integration has been carried out the only quan- 
tities left on the right-hand side are the parameters, viz , the £„ @ and t]/ 
and the physical volume occupied by the system being represented. 
The latter is involved in the integration limits for the q’s. The limits 
for the p’s are -j-oo and -oo effectively. 0 = kT. Hence effectively 
— kl\ depends only on the volume and temperature and the other 
parameters which characterize the state. Therefore —krj represents a 
state variable and its association with entropy may be considered sub- 
stantially verified. 

11 Cf. Gibbs, "Statistical Mechanics,” Chapter XII For a concrete illustra- 
tion, Lindsay and Margenau, op. at , p. 245, may be consulted. 



146 


CLASSICAL STATISTICAL MECHANICS 


[Ch. VI 


14. FREE ENERGY AND THE GIBBS PHASE INTEGRAL 


We have so far said little about the parameter p which enters into 
the definition of the probability coefficient for a canonical ensemble. 
But eq. (134) written in the form 


dp = — SW, 


with ij = — S/k and d@ = kdT, is in precisely the form of eq. (10) of 
Chapter III, in which d p (there represented as dV) represents the 
change in the free energy of the system. Consequently we may safely 
treat p as the statistical mechanical analogue of the free energy. We 
have already commented on the importance of this thermodynamic 
function in the derivation of the equation of state (eq. 12 of Chapter 
III). It is interesting to observe that in the Gibbs statistical mech- 
anics p is immediately given in terms of a certain integral. Thus from 
(83) 

* = - © lo gj e~ B/9 dp. (149) 


It is customary to refer to 



-H/e 


d<t> as the Gibbs phase integral, 


sometimes denoted by I. Its evaluation as a function of 0 and the 
external parameters of a system therefore leads at once to the equation 
of state. If we compare (149) with eq. (72) of Chapter IV we see that 
the Gibbs phase integral appears to bear some analogy to the function 
Z, which there we called the distribution or partition function. Indeed 
the connection looks even closer if we examine again the evaluation of 
Z for a system of free particles in Sec. 4 of Chapter IV. There except 
for a multiplicative constant we actually computed the phase integral. 
(Note : It must not be forgotten that p in statistical mechanics corre- 
sponds to 4' in Chapter IV.) 

As an illustration of the phase integral we shall calculate it for the 
special case of a system of n simple harmonic oscillators with masses ntj 
and stiffness coefficients k y The Hamiltonian is 


H = 




(150) 


if we suppose that the oscillators are free of mutual interaction. We 
therefore have from (149) 

P = - © log^ (2n) J e-* p * /2m > 9+k rf /2S) h' n d qi • • • dq n d/> x • • • dp n , 

( 151 ) 
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with the limits of integration taken between - » and + oo for each var- 
iable. In this expression we have written dd> = — 

A" 

where A is a quantity having the dimensions of coordinate times 
momentum and is here put in to secure the proper dimensionality (cf. 
the discussion after eq. 83 in Sec. 9). Now 


Consequently 



— pj/2m,e 


dp } = V 


+ = 


0 log ©" 



(152) 


If the dependence of the frequencies v, on the volume of the system 
were known we could use (152) to determine the equation of state. 
However, we can at any rate get the expression for the entropy. From 
the equipartition of energy we know that 

£ = «0 = nkT. (153) 

This could of course be computed directly from II. Now from ^ = 
E — TS, we get 

-* + *■<* 

-»*+* '■*]!(£} (i54 > 


So far as the Gibbs statistical mechanics is concerned A is just a con- 
stant having the appropriate dimensions to make the fundamental 
expression (151) dimensionally correct. It is clear, however, that 
we can interpret A" as the unit of “volume” in phase space. As such 
in classical theory it may have any numerical value. According to the 
quantum theory, however, A is a fundamental constant of nature, 
called the Planck constant of action, with the value (cf. Chapter 
vin ) 

A = 6.55 X 10 -27 erg sec. 


The dimensions of hv } are then those of energy like kT. 

In the classical theory of solids, a solid crystal is considered to be 
effectively a collection of harmonic oscillators. Consequently the 
formulas just derived have an application to an ideal crystalline solid 
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In particular from (153) we can get the heat capacity at constant vol- 
ume for such a solid. Thus _ 

Cy = = nk. (155) 

Here n is the number of degrees of freedom of the crystal. If we think 
of each atom making up the crystalline solid as having three degrees of 
freedom and assume that the number of atoms per gram molecule is 
still Avogadro’s number, it follows from (155) that the molar heat 
capacity of a monatomic crystal is 3(6.06) X 10 23 X (1-37) X 10“ 16 
ergs/degree C. When reduced to calories degree C this figure becomes 
5.96 cal/degree C. This is in rather good agreement with the experi- 
mental value for monatomic crystals at room temperature. The simple 
classical theory here presented fails indeed to account for the variation 
of the molar heat capacity with the temperature. For this the quan- 
tum theory seems to be demanded. (Cf. Chapter IX, Sec. 3.) 

It should be possible to use eq. (149) for the free energy to attack 
theoretically the derivation of the equation of state of a real gas. Con- 
siderable progress has been made on this recently but the subject lies 
beyond the scope of the present book. 1 * 

15. ALTERNATIVE INTERPRETATION OF ENTROPY 

It is well to recognize that — ?j is not the only statistical mechanical 
quantity which possesses the appropriate properties to serve as an 
analogue of entropy in thermodynamics. Consider the quantity 

S = k log bn, (156) 

whereof is the total phase volume enclosed by the energy surface 
H — E, and E is the average energy over a canonical ensemble. Let 
the energy of the dynamical system represented by the ensemble, 
namely E, be changed slightly without altering the external parameters 
(i • ■ • Physically this will correspond to a flow of heat into or out 
of the system. Then from (156) there will be a change in 5 of magni- 
tude 

dS = kd\og<f,s = ~^dE, (157) 

where d<j>g/dE is the rate of change of <f>g with respect to E while the 

11 An elaborate discussion will be found in Mayer and Mayer, "Statistical 
Mechanics,’’ John Wiley & Sons, New York, 1940, pp. 277 ff. 
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£'s remain fixed. From eq. (60) d<j>§/dE - 01 (E). Hence 


dS = ~^. 

4>e/u(E) 


(158) 


Now we have already shown (eq. 71) that 0j/w(£) is twice the average 
kinetic energy per degree of freedom taken over the microcanonical 
ensemble corresponding to total energy E. If the system consists of 
free particles, as we shall here assume for convenience, this means that 


4>b _ 2 E 

«(£) “ 7 * 


(159) 


if f is the number of degrees of freedom. But for an aggregate of free 
particles, the equipartition theorem yields 


Therefore (158) becomes 



dE 


(160) 

(161) 


But since dE must here represent change in energy caused by the 
reversible flow of heat, this is the usual expression for the change 
in entropy in a reversible thermodynamical process. 

Although the above discussion is not very rigorous and is indeed 
rather specialized the suggestion is, at any rate, that k log <f>g is a 
possible analogue of the entropy. We could proceed to apply to it the 
tests used on — rj. It will be simpler, however, to examine directly its 
relation to — rj. We shall restrict the discussion to an aggregate of 
free particles, i e., an ideal gas. 

We begin with 

e -i = e -*/e//e (162) 


But for an ideal gas this becomes 

g-i = g-*/v /2 . 

We can dispose of the factor e~* ,B by recalling that 

e~* /B = J e~ a/e d<l>, (163) 

where the integration is to be conducted over the whole phase space. 
Writing 

d<t> — d<pqdt$>p t 
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where dp 9 refers to that part of the elementary phase volume in which 
the q's enter and d4> p the corresponding momentum part, we can further 
say that 


,-*/e 


- +« +» 

= r" 8 / ... j e 

i/.QO %S — 00 


dp/, (164) 


where r is the physical volume occupied by the gas. The /-tuple 
integral in (164) is evaluated in the usual way. The result is 


e~*» = r' /s (2 mef. 

The consequence is that 

— * = log T //3 + ^ log 2t w© + |- 

We now wish to compare this with log <f>g. This may be written 

-s 


<t>& — r 778 f d<t> P . 
Jq 


(165) 

(166) 

(167) 


The integral has already been evaluated in eqs. (116) and (120), and 
we can immediately write 

t" 3 (2 *m)" 2 E f/ * 


Pi 


(168) 


This leads to 


r(//2 + 1) 

log <t>s = log r 778 + ~ log 2 vm& + 2 log 2 ~ log r (2 1 )’ 

for large positive / 


Now the asymptotic expansion of log T (i + ‘) 
can be put into the form 1 ' 

108 r (2 + *) - (I + 0 io * { - (I + 0 + >°« ^ 

+ terms involving ^ in the denominator. 

A 


Neglecting terms in this expansion small compared with //2 we see 
that log Pa can be expressed in the form 

log <t>g = log / /3 + ^ log 2 h 7M© — log y/vf -f (169) 

A A 

If now we compare —rj in (166) with the asymptotic form of log 4>s 
u Whittaker and Watson, "Modern Analysis" fourth edition, Macmillan, 1928. 
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in (169) we see that they differ only in the term log -y/rf which is very 
small compared with the other terms in / if / is large. Hence as / 
increases we have the asymptotic relation 

-tj~log4>j. (170) 

This indicates the important relation between the two definitions of 
entropy. 

16 . r£sum£ 

This concludes our survey of the statistical mechanics of Gibbs 
which will stand for a long time as a monument of the power of abstract 
thought over physical problems. At the risk of a certain amount of 
redundance it will be desirable to sum up its features as contrasted to 
those of the Maxwell-Boltzmann method of statistical distributions. 

We recall in the first place that the statistical distribution method 
operates throughout with the actual system being discussed. Thus, 
for example, we have a certain set of independent particles with a 
certain property, e.g , energy, and inquire about the most probable 
distribution of the particles with respect to this property. This most 
probable distribution is, of course, arbitrarily defined, but the defini- 
tion in terms of the number of ways of realizing each state of the sys- 
tem, is at any rate plausible. This process leads to the so-called 
canonical distribution. We seek to identify the parameters entering 
into the distribution, i e., ©, and log P e or log w, with observed 

properties of the system being described. This is done by showing 
that relations satisfied by these quantities are mathematically of the 
same form as the important thermodynamical relations among the 
state variables of the system in the thermodynamical mode of descrip- 
tion. This provides one statistical interpretation of the macroscopic, 
thermodynamical properties of physical systems. The scheme has been 
criticized from several points of view, notably because of the use of 
Stirling’s formula in its mathematical development to evaluate N ! no 
matter what the value of N is. This has led to an alternative formula- 
tion, namely the method of Darwin and Fowler, which we shall discuss 
in the following chapter. 

The method of Gibbs does not operate with the actual system being 
discussed. Rather it builds an abstract ensemble to represent the 
behavior of the system. The subsequent analysis is carried out wholly 
with the ensemble and the connection with the properties of the actual 
system is made solely through the fundamental postulate of Sec. 6. 
Thus averages taken over the ensemble are conceived to represent 
observed values of the corresponding quantities for the actual system. 
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This places extreme importance on the choice of ensemble. Gibbs’s 
choice of the canonical ensemble (he made little use of the micro- 
canonical ensemble) must be considered a stroke of genius. Operating 
with it, he was able to provide an analogy for all the thermodynamics 
which was known at his time. It is interesting to observe that he made 
no effort to build a model mechanism for the physical system under 
consideration. The canonical ensemble is in no sense a model. It is 
an abstract fiction, having no physical existence. Of course, it is true 
that to calculate averages over a canonical ensemble, one needs the 
Hamiltonian function of the system being described; this marks 
Gibbs’s scheme as lying wholly within the framework of classical 
mechanics. The Maxwell-Boltzmann method is not subject to this 
particular restriction as it can envisage the distribution of any set of 
entities whatever. In one sense, therefore, the Maxwell-Boltzmann 
method is more general than that of Gibbs. On the other hand the 
Maxwell-Boltzmann statistics is more specialized in the sense that it 
operates only with free particles and thus neglects the possibility of 
their mutual interactions. The attempt to apply the Maxwell- 
Boltzmann statistics to a real gas, for example, necessitates the 
introduction of mechanical concepts and the postulation of forces 
lying outside the framework of the method itself. Indeed it involves 
essentially the application of some kind of kinetic theory. The method 
of Gibbs, on the contrary, is general enough to include all sorts of 
dynamical systems within its scope. 

The comparison between the two types of statistical method will 
hardly be complete, however, without an exposition of the Darwin 
and Fowler modification of the Maxwell-Boltzmann scheme. This 
will form the content of the following chapter. 


PROBLEMS 

1. Apply Euler's theorem on homogeneous functions to deduce eq. (22). 

2. Write the expression for the kinetic energy in terms of spherical, cylindrical 
and paraboloidal coordinates. Evaluate the component conjugate momenta in 
each case and comment on their physical significance. 

3. Prove that the canonical equations are invariant in form with respect to an 
arbitrary point transformation of generalized coordinates 

4. A particle revolves in a circle about a fixed axis. Plot the representative curve 
in phase space. 

5. A particle moves along a straight line in a uniform field of force. Plot the 
representative curve in phase Bpace under the assumption that the particle is not 
allowed to exceed a certain maximum velocity. 

6. A particle moves along a straight line in a field of force directed toward a 
fixed point on the line and varying inversely as the square of the distance from the 
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point. Plot the representative curve in phase space under the assumption that the 
particle is confined to a segment of length a on either side of the fixed point. Assume 
that the total energy is negative. 

7. In the case of the simple harmonic oscillator form the integral £ pdq, which 

is supposed to be taken over a whole phase curve. This is called the phase integral. 
What allowed energies of the oscillator correspond to equating the phase integral 
to nh, where h is Planck’s constant of action and w is integral? Do the same problem 
for the simple rotator mentioned in Problem 4 above. 

8. Find the expression for the average energy over the canonical ensemble cor- 
responding to a simple rotator. Do the same for a microcanonical ensemble. 

9. Discuss the phase space for inverse square central field motion in terms of 
spherical coordinates. N B This is, of course, fout-dimensional, but since in 
central field motion fy is constant, one can adequately represent the situation by a 
three-dimensional phase space, by employing r, 9, and pr as coordinates. Plot the 
surface of constant energy in this space. Indicate the phase curve on this surface. 

10. Use Gibbs’s method to show how Liouville's theorem maybe used to prove the 
invariance of the element of volume in phase space with respect to an arbitrary point 
transformation. 

11. A particle moves in an inverse square central force field in an elliptical orbit 
with the force center at one focus. Find the expression for the probability that the 
particle will be found with its radius vector lying in the interval r,r + dr. 

12. Prove that the canonical average of the energy of the simple harmonic oscil- 
lator is equal to the modulus © by evaluating 


+ fcg*/2)e~ + *t*/2>/8 dp dq 

y + “ J g - (pV»+t«V»/0 dp dq 


13. Calculate the root-mean-square deviation of the energy from the canonical 
mean for an aggregate of N simple harmonic oscillators. 

14 . Evaluate the momentum space volume in eq. (112) directly by the use of 

J /*x 

’ as" -1 * - 'da; and the beta function £(m, n) = 

o 


y * m_i a - x)'~ i dx. 

and Co , 1912 ) 


(Cf. E. B. Wilson, "Advanced Calculus,” p 378 


Ginn 


15. Evaluate - (§ec n) for a system consisting of a single particle with 

AiV(£mp) 

three degrees of freedom where E = 1 01 Emp Do the same for a system of 100 
particles. 

16 . Prove that —i) (Sec, 13) is a maximum for a uniform distribution of the ele- 
ments of an ensemble in phase spare as compared with any other distribution of the 
same number of elements and corresponding to the same value of the average energy. 
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STATISTICAL MECHANICS BY THE METHOD OF 
DARWIN AND FOWLER 

1. FUNDAMENTAL CONCEPTS 

The basic statistical concept in Gibbs’s statistical mechanics is the 
ensemble. This is abandoned in the more recently developed method 
o( Darwin and Fowler 1 and we must understand clearly the significance 
of the change. Gibbs considers a system of / degrees of freedom and 
proceeds to construct an ensemble whose elements are exact copies of 
the system in all its possible phases. On the other hand, the Darwin- 
Fowler method visualizes the system in question as made up of a 
large number of independent constituent systems, e.g., N in number. 
In their notation the N constituents together form an assembly of 
systems. At any instant the state of the assembly depends on the 
states of its constituent systems and quantities representing properties 
of the assembly are averaged over all possible states of the assembly. 
For a concrete example, suppose that a perfect gas consisting of N 
free mass particles is to be described statistically. Each particle will 
be considered a constituent system of the assembly, the latter repre- 
senting the gas as a whole. The state of the gas depends on the state 
of the constituent particles and since each particle is capable of existing 
in various states characterized by different position, momentum, 
energy, etc., the corresponding states of the gas, i.e., the assembly, 
are very varied. 

It is evidently necessary to make clearer what we shall mean by a 
state. Let us suppose that the property in which we are interested is 
the energy. The state of the assembly would be ideally specified by 
stating the precise energy value of each constituent system. Since in 
practical applications the constituent systems are very numerous and 
since in general they are indistinguishable, e.g., all particles of the 
same nature, this mode of specification is impracticable. We there- 
fore fall back on the specification of the number of constituent systems 
in each allowed energy interval. Effectively the situation is like that 
envisaged in Sec. 1 of Chapter IV, where we considered the distribu- 

•R, H. Fowler, “Statistical Mechanics,’’ second edition, 1936. Cambridge 
University Press. The original joint papers date from 1922. 
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tion of N indistinguishable objects among n boxes. The N objects 
constitute the assembly and the n boxes are possible or allowed energy 
values. As far as classical statistics is concerned these energy boxes 
may be of arbitrarily small size and continuously distributed. Accord- 
ing to the quantum mechanical point of view, however, the allowed 
energy values may be discrete, corresponding to a lower limit to the 
size of the boxes. Since we shall shortly have occasion to apply statis- 
tics to problems treated by the quantum theory we shall take from the 
start the general point of view. 

It is well to emphasize that, like the Maxwell-Boltzmann method, 
the Darwin-Fowler statistics in the form presented here operates with 
independent constituent systems. It is only for these that one can 
talk of the number of systems in each allowed energy interval, etc. 
Any attempt to generalize the method to apply to interacting con- 
stituent systems would appear to necessitate the introduction of 
Gibbsian ensembles (cf. Sec. 6 of this chapter). 

As in Chapter IV we shall assume that the individual possible 
energy states or boxes have certain a priori probabilities or elementary 
weights associated with them which we shall designate as g 0 ,g i ■••&•••. 
We generalize the earlier discussion by refraining from setting a limit 
to the number of possible states. In the discussion of quantum 
statistics (Chapter VIII) it will develop that the elementary weights 
of the energy states of a quantum mechanical system are always 
integers. On the other hand, in classical statistics the elementary 
weight associated with the element dp in phase space is dp/h n , where 
h n merely represents the unit volume in phase space and h is Planck’s 
action constant. The inclusion of the divisor h n is to secure the 
necessary non-dimensionality in the weight. (Cf . Sec. 14, Chapter VI.) 

We are now ready to write the expression for the probability or, as 
we shall now call it, the weight to be associated with that state of the 
assembly in which, of the N constituent systems, N 0 are in the energy 
state E 0 , N t in E u ■■■ N, in E„ etc. This follows immediately from 
eq. (6) of Chapter IV with appropriate change in terminology. If 
the weight in question is denoted by IP, we have 


W = 



( 1 ) 


1 


where 


( 2 ) 
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the fixed total number of constituent systems in the assembly, and 

2 E i N s = E, ( 3 ) 

i 


the fixed total energy of the assembly. 

Now we want our statistics to give us the average number of con- 
stituent systems in any particular energy state, say that corresponding 
to E r , subject to the conditions (2)_and (3). Evidently to get this 
number, which we shall denote by N r , we must first multiply N r by 
the weight W and sum over all sets of numbers No, Ni • • • N, • • • 
satisfying conditions (2) and (3). Then we must divide the result by 
the sum of all the IP's consistent with (2) and (3). In abbreviated 
symbolical form 


N r 


2N t W 
2 W ' 


(4) 


To grasp the significance of the method it is essential to understand 
the meaning of the sums in the expression for N r . Going back to (1) 
we see that a value of W corresponds to each mode of distribution of 
the N constituent systems over the energy states. To get we 
must add all these various values of W for all possible modes of dis- 
tribution consistent with (2) and (3). This gives the denominator in 
(4). The numerator is obtained likewise, only before summing we 
multiply each value of W by the N r which is appropriate to that value. 
The remaining problem now is the mathematical evaluation of these 
summations to express the value of N r as a function of the elementary 
weights and the energy E r associated with the rth state. We shall 
discuss this in the next section. 

At this place, however, we ought to notice the difference between 
the method of procedure here and that followed in Chapter IV in the 
treatment of statistical distributions. There we made W (or its equiv- 
alent P„) a maximum subject to conditions equivalent to (2) and (3). 
This resulted in the so-called canonical distribution (eq. (27) of Chap- 
ter IV) ; it was assumed that the number of systems associated with 
a particular energy value in this distribution would correspond to 
the observed number when the system was in a state of equilibrium. 
We then used this number to compute average values, e.g., the total 
energy under various assumptions as to the possible energy values. 
The Darwin-Fowler method proceeds differently; average values, e.g., 
N r , are calculated directly without the necessity for maximizing W. 
We shall find to be sure, that these average values will agree with 
the corresponding distribution formulas of the Maxwell-Boltzmann 
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statistics as well as the averages calculated on the basis of canonical 
ensembles in Gibbs’ statistical mechanics. Nevertheless the variation 
in the mathematical machinery used provides an interesting check on 
the other ways of looking at the problem and certain questionable 
approximations, e.g., the wide use of Stirling’s formula for Nl in the 
analysis of Chapter IV can be avoided. In this way our confidence in 
the statistical point of view will be strengthened. Moreover the 
method of Darwin and Fowler provides a very natural introduction 
to quantum statistics, used for most contemporary problems in statis- 
tical physics. 

2. EVALUATION 07 AVERAGES 


Our next problem is the purely mathematical one of evaluating 
the sums in (4). First consider 




NglNil ■■ 


( 5 ) 


We attack this by means of the multinomial expansion, 
the binomial expansion 


(zo + *)* = 



Nl 

No'.Nil 




First recall 


( 6 ) 


where N 0 + N 1 = N and the sum is taken over all N 0 and Ni satis- 
fying this restriction. By a simple generalization 


(*0 + Zi + z 2 + • • ■ +z r + ■ • • ) N 



Nl 

No'.Nil--- N r l--- 


4° 


( 7 ) 


where the summation is taken over all No, Ni - N r , • • • , etc., satis- 
fying the condition 21V, = N. The connection between (7) and (5) is 
obvious, but we have still to introduce the g’s and the condition 
2IV,£, = £. We do this by taking the expansion 

(goSf®* + giz® 1 + ■ • • + gr^ T + • • -) N 



Nl 

NqI N il Nrl ■■■ 




ZNjBj 


( 8 ) 


If no further restriction is placed on the N } beyond that implied in 
eq. (2) the sum on the right of (8) contains all powers of z given by 
gZNfBj' j£ now we w i s h to restrict the Nj further by the condition (3), 
where £ is a constant, the only terms in the sum on the right of (8) 
which interest us are those in which z is raised to the power £. The 
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sum of the coefficients of all these terms is then our 2W. It follows that 
"LW is equal to the coefficient of z* in the polynomial expansion in 
eq. (8). If we can find some way of evaluating this coefficient we 
shall have found the desired expression for 2W without employing 
Stirling’s formula. 

Consider the contour integral in the complex plane 



where z is the complex variable x + iy = r(cos 6 + * sin 8) and the 
path of integration C is a circle of radius r about the origin. More- 
over n is an integer. This integral may be evaluated by means of real 
integrals by transforming to the equivalent polar coordinates. Thus 
dz = r(— sin 0 + i cos 0)d0, whence 



/'< 


(cos 8 -f i sin 0) n+1 d0. 


By utilizing De Moivre’s theorem 


we have 


(cos 0 + i sin 8) n — cos nO + i sin nO, 


( 9 ) 


But 

jf 


j ' z n dz — trn+l [cos (» -f- 1)8 -f- i sin (n + l)0]d&. 


cos (« + 1 )8d0 


-r* 


sin (» + 1 )0d0 — 0 for n — 1, 


while 


r 

/ cos (« + 1 )8dd = 2?r for n = — 1 and 
Jo 


/ 2w 

Si 


sin (» + 1 )8d8 


= 0 for « = —1. Hence we reach the general conclusion that 


I 


z n dz = 2vi for n — — 1, 
= 0 for n — 1. 


( 10 ) 


It follows that if we integrate (S g,z B ’) N about C and divide by 2 vi we 
shall obtain the coefficient of z -1 in the expansion. This of course 
assumes that all the exponents in the expansion are integral. To 
assure this all we need do is to choose our unit of energy so small that 
effectively SiV,J£, can always be represented to a sufficiently high 
degree of accuracy as an integer. We want, however, the coefficient 
of z B . We must therefore integrate (Zg j z gi ) N /z B+1 about C to get 
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the coefficient of iff* in the expansion (2gyz * *)*. Let us call the sum 
in the parenthesis for convenience, Z, viz., 


Z = Zg,z E \ 


(ID 


We shall follow Darwin and Fowler in referring to this as the “ par- 
tition function.” * In this notation, then 


ZW 


2iri Jc z B+1 


( 12 ) 


We must next find the numerator in the expression for N r On mul- 
tiplying the summand of (5) by N r and then cancelling it in numer- 
ator and denominator, we obtain 


ZN r W 


n ST' (N- ?*' 

^iVoliVx! (N r — 1) ! N r+ i ! • • ’ 


(13) 


where the summation is still to be conducted over all N 0 , N t ■ ■ ■ N r • ■ ■ 
consistent with conditions (2) and (3) Let us now introduce a new 
set of numbers Mo, Mi, • • • M r , • • defined by 


Mo — No, Mi — N u • M r — N r — 1, M r+ i = N r+ i, 
Then ZN r W may be written 


2 N r W = Ng r J2 


(N - D'g^gf 1 g? r 
M 0 l Mi' M r l 


(14) 


where the summation is now conducted over all Mo, • • ■ M r ■ • con- 
sistent with the conditions 2 M } = N — 1 and ZM } E } — E — E r . 
The previous analysis now indicates that 2 N r W is Ng r times the coeffi- 
cient of z B ~ Br in the expansion of (Zg,z E, ) N ~ 1 . Hence 


2 N r W = 


Ng r f Z N -'dz 
2m Jc z E ~ E ’ +l ' 


( 15 ) 


and the average quantity we desire is 


N r 



( 16 ) 


The evaluation of the integrals in the numerator and denominator of 
• In Fowler, op. cit., p. 38, /(*) is used to denote the partition function. 
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N r is carried out by the method of steepest descents, which will now 
be described.* 


We consider the function 
4>(z) = 


Z{z) 2 g,**' 

j^/lf ~ Z SN,E,/N > 


(17) 


which can be written in the alternative form 

, , , (fO + + to*?'-** + • • + grZ* r_ 

= „m/N-l)Ev+Xi/N £ 1 + +Nt/N E t + 


It is clear that we can split <f>(z) into two sets of terms, one a series of 
negative powers of z and the other a series of positive powers. Thus 

j-n «> 

= 22 &/**’ + 2t2 gi ^ % ( ig > 

1-0 »=n + l 

We recall that the g's are all positive integers as are the exponents 
k , and It will be noted that we can always arrange our zero energy 

level so that all the E, are positive and increase monotonically with j. 
Consider the behavior of <£(z) on the real axis. At z = 0, <j>(z) is cer- 
tainly infinite owing to the presence of the first term in (18). At 
z = 1, <t>(z) is again infinite because the second term diverges for z = 1 

PC 

(being ^ ^ g,.) Let us now differentiate <t>(z). We get 


Now as z goes from 0 to 1, both terms on the right side of (19) increase 
monotonically. This is shown diagrammatically in Fig. 7 • 1 where 
A is a schematic representation of the plot of 2 for real values 
of z between 0 and 1, while B represents — 2& ; g,/z* ,+1 in the same 
interval. A schematic plot of 4>{z) is also included. There is only one 
value of z between 0 and 1 for which d<t>(z)/dz — 0, and here <j>(z) 
will have a minimum. Let us call this value z = f. Going back to 


(17) we have then 


(dZ/dz EZ/N\ 

\ 7 ?'" z^+V.-r - 


or 


„ m{dz/dz) t _di og z(r) 

£ — m~ ■ Nl ~7T~ 


( 20 ) 


which acts as a defining equation for f. 


1 The method was apparently first described by P Debye, Moth Ann., 67, 535 
( 1909) ; Munch. SUzungsbenchte, 40 (1910). See also E. T. Copson, “Theory of 
Functions of a Complex Variable," p. 330. Oxford, 1935. 
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Now consider the circle in the complex plane with center at the 
origin passing through the point z = f . This circle is represented by 
the equation z = ie la , where a is the angle about the origin measured 
from the real axis. We seek the behavior of [^(z)]^ = [Z(z)] N /z B on 
this circle, where 


[ 2 ( 2)1 


N _ 




( 21 ) 


If we write out the*sum in the bracket in (21) it will have both a real 
and imaginary part. The absolute value of this resulting complex 
expression becomes 

I 2(z) | = VZgjf*' + cos a (£, - E k ). 

For a 0, | Z(z) | is less than its value for a = 0, unless for all the 
values of E, there exists the relation 

a(E, — Ek) = 2ir« (22) 

for all j and k, n being any integer. Consequently if the condition 
(22) is not fulfilled, the abso- 
lute value of Z(z) will be 
greater at z = f than for any 
other point on the circle 
z = fe*“. Therefore if N is 
large, [Zfz)]^ will have a 
strong maximum at this 
point. For this reason z = f 
is known as a “ saddle ” 
point, since whereas it corre- 
sponds to a minimum on the 
real axis, as we go away in 
either direction from the real 
axis the function falls away 
very steeply, the steepness 
increasing as N increases. We 
shall suppose that N is so 
large that the value of [Z(z)] Ar 
and hence the value of [<£(z)] N 
at any point on the circle save 
the saddle point is negligible 
If this circle is chosen for the 

contour C, the value of the integral in the numerator of (16) becomes 
effectively 

„ r(gS'\z N dz ^ r.z* 



Fig 7-1 


B+V 


■dz, 


(23) 
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since the value of z Er /Z will have no appreciable effect on the integral 
save for z = f. This enables us to express the average value N r at 
oftce in the form 



Ngr r* 

2(f) ’ 


(24) 


without the necessity of calculating 


L 


rtr 


dz. It should be noted 


that strictly speaking what we have said above about the saddle 
point corresponding to much larger values of [Z^z)] 1 * on the real axis 
than for points just off the axis also applies as well to other points on 
the real axis for 0 < z < 1. However from function theory it follows 
that the result of the integration is independent of the precise contour 
around the origin. Hence we are at liberty to pick that contour for 
which the integration is simplest. This proves to be the case for the 
contour passing through the saddle point. It is usually assumed that 
the descent from the saddle point is steepest (cf. Fowler, op. cit., 
p. 36) and indeed this seems qualitatively to be the case, though we 
shall not endeavor to give a proof of it here. The steepness will 
naturally improve the accuracy of the evaluation of N r by means of 
contour integration. 

We have still to examine what happens when eq. (22) is satisfied. 
It will be noted that this will be true only if all the differences E, — E k 
have an integral common factor, say l. For then (22) will hold for 
a/2ir — 1 / 2 , 2/1 ■ • ■ (l — 1 )//. Consequently instead of only one 
maximum point on the contour circle there will be l such points. 
The result is to multiply the integral in (23) by Z, but since numerator 
and denominator are multiplied by the same quantity, the value of 
N r given in (24) is not affected. 

The contour integral for 2PF will be of importance in the subse- 
quent analysis and we shall therefore evaluate it here. Let us expand 
log [*(*)]* in a Taylor series about the point z = f and along the 
circular contour. We have 


log [*(*)]" = N log <t>(z) = 


w {^* (r)+(l _ r) [i!^] i+ &zp!.[^] t+ 


(25) 


But at z = f , <j>(z) has its minimum and therefore the coefficient of 
z — f vanishes. Moreover if e — f is small enough we can replace it 
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by iaf to a sufficiently good approximation. Hence we have 


log 0(z>r - N jlog 0(f) - fa 2 / 0(f) + 


d 2 <KS) 


or 


m] N - [*(f)]*e >’^> 


/*(*-) + •••} 

(26) 


Incidentally this serves to exemplify the large rate of decrease of 
Ofz)]* for N very large as one passes from z = f to some nearby 
value. 

We now have, writing dz = iz da, 


2irt 



[»(f)F r + ° 

2r X. c 


da, 


(27) 


where we have availed ourselves of the possibility of conducting the 
integration about any convenient path which passes through z = f 
(where a = 0) since the integrand is extremely small save at this 
point. The limits are chosen as -j- » and — w for simplicity also. 
The result of the integration is 



Let us return to the formula (24) which gives the average number 
of systems in the assembly in thejmergy state E r . It is the distribu- 
tion law. Evidently if we sum N r over all r we must get N. This 
condition indeed is satisfied by (24) since 



(29) 


The presence of the parameter f and the partition function Z(f) makes 
the distribution law appear a little strange, especially as it was said 
above that the Darwin-Fowler method leads to essentially the same 
result as the canonical distribution in the M axwell- Bol tzmann statis- 
tics. However we recall that the latter involves the parameter © 
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which we have found it necessary to interpret as kT in order to estab- 
lish connection with experimental results. If we now set 

r = e~ 119 = e~ 1/kT , (30) 


(24) becomes 


N r 


Ng r e~ E ' /kT 
2 gr C- E ' /kT ' 


(31) 


in precise agreement with the canonical distribution (27) or (27') of 
Chapter IV (making allowance for changes in notation). We shall 
shortly present independent reasons for making the assumption (30). 
To do this we shall find it necessary to discuss assemblies consisting 
of more than one kind of system. 


3. ASSEMBLY OF TWO KINDS OF SYSTEMS 

In place of an assembly consisting of systems all of which are of 
the same type, let us imagine an assembly in which there are two 
kinds of systems. The analysis can be readily generalized to any 
number of types, but we shall find it convenient for the sake of sim- 
plicity of notation to confine our attention to two. 

Let now the number of systems of the two kinds be Ni and N 2 
respectively, where 

Nt+N 2 = N. (32) 

Of the Ni systems of the first type, let Nio, Nu, N 12 , ■ • ■, Ni r , • • • be 
the numbers in the energy states E\ o, E\\, •••, E\ r , • * • respectively. 
Of the N 2 systems of the second type, let iV 2 o, N 2 u N 22 , ■ • , N 2r , 
reside in the energy states E 2 0 , £ 2 i» • , E 2r , ■ • • respectively. Note 

the necessary change in notation from Sec. 2 in order to denote 
adequately the two types of systems. The a priori weights attached 
to the energy states of the first type of system are now gio, gn, gu< • • • , 
girt • • • and those for the second type of system g 2 o, g 2 i, • • • , g 2r , • • • . 

Clearly we have the relations 

= Nii ^ ' n 2j = N 2 ; 

i i 

(33) 

'y^jNijEi, + ]) ' N 2 } E 2j = E. 

I j 

Here E is the total energy of the assembly. The eqs. (33) replace 
the two equations (2) and (3). 

Our first task is to express the weight to be associated with the 
state of the assembly in which N\q systems of the first type are in 



Sec. 3] 


ASSEMBLY OF TWO KINDS OF SYSTEMS 


165 


the energy state Eiq, Nn in En, •••, iV a o in £ 20 . etc. From the 
analysis of Sec. 1 this weight is expressible in the form 


j 1 # 

n> ! 


The average number of systems of the first kind in the energy state 
Ei r is then JV lr and its value is given by 


The procedure for the evaluation of N ir follows that employed in 
Sec. 2 Thus we use the polynomial expansions. 


3 

*.n* 

Vg 2] 2?") N ' = Y, ~TT 

^ 11^;! 

3 

The product of (36) and (37) is 

(2gI } 2*0* l (2g2 J ^ J^,I )* , - 

V'' l . l 

^ n^ ! n^ ! 


2AT; ij Bij 




. zZWijXii+NttXt,)' 


By precisely the same reasoning as in Sec. 2, it follows that the denom- 
inator in the expression for N r , viz., 2 W, is the coefficient of z B in 
the expansion (38). Moreover the numerator can be handled in simi- 
lar fashion. Let us write 


The result is that 


2*i, = Zf. 2g 2 ,z*" = Z 2 


f z Eir . £ 

Jc Z x 

Nlr Tz^zF 

Jc ** +1 


^ dz 

Z _£ + l aZ 

z 


(40) 
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The application of the method of steepest descents with 


yields finally 


[*<*)]" = JL ?r - . 


where f is the value of z which makes 4>(z) a minimum on the real 
axis. Similarly 

iV7r = -%g2r^, (42') 

Z2(i) 

with the same f, since there is again only one minimum for <h(r) on 
the real axis between 0 and 1. Since 


= 0 for 3 = f , 


we get the relation 


E = Ntfy log Z, + l\r 2 f ~ log Z a . (43) 

®f of 

Each term on the right of (43) is of the same form as the expression 
for' 22 in an assembly of systems all of the same kind, viz., eq. (20). 
This leads to the expectation that each term in (43) may be looked 
upon as the average energy of the type of system it represents. This 
can be verified as follows. From the definition of average we have 

_ ZW(ZN U E X] ) 

£ ‘ = 2W~ ■ (44) 

The evaluation of the numerator is carried out by differentiating both 
sides of eq. (36) with respect to z and multiplying the result by z. 
Thus 

»T I I I /W T, \ 2>«*1 


- 

az 


'"■ L ' ri^ 


* 

Multiply further by Z% x and get 

zi‘ — 1 w — tt • 
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The coefficient of in the expansion on the right is precisely the 
numerator of E\. Hence we again apply the contour integration and 
have 


Ex = 


£z%'z~Z?'/z E +'-dz 

£z? 1 Z$ l /z E+l -dz 


£dz/z B+1 • NxZ^Z?' z ~ log Zx 


+1 


dz 


(47) 


The method of steepest descents applied to (47) then yields 


Ei = Nxt-\ogZx(£) t 


(48) 


which provides the confirmation of our surmise above that the total 
energy is the sum of the average energies of the component assemblies 
of different kinds of systems. 

We are now ready to give some reasons for the association of f with 
the absolute temperature of the assembly as assumed in eq. (30), viz., 
f = e~ 1/kT . Suppose we consider first an assembly consisting of Nx 
systems of only one kind. The average number of such systems in the 
rth energy state is then given by (42) (with f = fi) 


N lr 

where fi is given by 

Ex 


Nx 

ZxiSx) 


Sir 


t-Bv 
SI * 


Niti -jr log Zx, 

dfi 


(49) 

(50) 


the condition that fi shall provide the minimum value of <t>x(z) = 
Zi(z)/z^ l/Ni along the real axis. Consider next an assembly of N% 
systems of a second kind in which the corresponding average distribu- 
tion is given by 

N * r = Z 2 (£ 2 ) g2r (51) 

with fa corresponding to the minimum value of 4>a( z ) = Zi(z)/^ Nl 
on the real axis. Note, of course, that in general it is unnecessary 
that f i = f 2 . However if the two assemblies are brought together and 
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form a single assembly in equilibrium, the values Ni r and N 2t above 
given will necessarily be special cases of 


N ar = 


Nag* rf* 

z a <£) ' 


(52) 


- in which a = 1 and 2 respectively for the two types of systems and 
f is one constant which is the same for both as long as they are in 
equilibrium. Hence when two assemblies in contact are in equi- 
librium, they must have associated with them the same value of f. 
This at once suggests the possible connection of £ with the tempera- 
ture, since thermodynamically speaking it is the temperature which 
is the same for thermodynamic systems in equilibrium. It must also 
be remarked that from definition £ must be a positive quantity. 

We should be more specific. Consider an assembly of linear simple 
harmonic oscillators. We shall suppose there are two types, one with 
frequency v\ and the other with frequency v 2 . Let the numbers of 
the two types be Ni and N 2 respectively. Now the study of quantum 
mechanics reveals (cf. Chapter VIII) that the possible energy states 
of a harmonic oscillator do not form a continuous series but are dis- 
cretely distributed. In fact we have 


— ( j + -%)hvi ; E 2 j — (J + $)hv 2 . (53) 


At the same time the a priori weight factors are all unity. Thus 


gij = in = 1. for all j. 

Consequently the partition function Zi(f) becomes 

Zx($) =2^' = f W2 X/ 4 ' 1 - 

) ) 

Since ^ = 1/(1 — £ kn ) if j runs to infinity, we have 

2 


and similarly 


Zi(f) = 
ZiiS) = 


1 ’ 

£ w 2 


f.hn 


(54) 


(55) 

(56) 


The fundamental distribution formula (52) with the appropriate sub- 
stitutions then yields for the average number of oscillators of the two 
types in the rth energy state, 

JvT r = N t (1 - { kn )f kri , 

N^r = A 2 (l - £*'*)£ rfc -. 


( 57 ) 
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The average energy values are given by substitution into eq. (50) 
with = f. The result is 



E 2 - N 2 (~ + — k h * 2 _ • (59) 


Now Ei includes both the kinetic and potential energies, which in 
classical mechanics are each equal on the average to kT/2 per particle 
(equipartition of energy). When v x and v 2 approach zero we should 
expect the results of quantum theory to approach those of classical 
theory. Hence in the limit 


Ei E 2 

lim — = ft 7 = hm 

»l -. 0 -W 1 H 0 ■” 2 

(60) 

Now from (58) and (59) 

.. Ei 1 

lim — = — ; 

»l — * 0 -Wl logf 

This follows from 

(61) 

f Ari = 1 - fo-ilogr + .... 

(62) 

By comparison of (60) with (61) we get 


r = e~ l/kT , 

(63) 

as the indicated expression for £. In the next section we shall give 
still another demonstration of this relation. For the moment let us 
note that the above discussion need not be confined to a linear har- 
monic oscillator; the two- and three-dimensional cases are also easily 
handled. The reader may show that for the two-dimensional oscil- 
lator, for which E] = {j + \)hv from quantum mechanics and g, = 
j + 1, the partition function becomes 

Z tf) ~ ^ * 

(64) 

and the average energy becomes 


/ 2 hv \ 

E-N[hv + rh ,_ J- 

(65) 


Here we have for convenience omitted the subscript denoting the type 
of system in question. This leads to precisely the same connection 
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between f and T that we found in the />ne-dimensional case, 
now 



2 

logf’ 


For 


' but since the oscillator has two degrees of freedom, in the limit of 
vanishing frequency its energy becomes 2kT, and (63) is again 
obtained. 

For the three-dimensional oscillator, quantum mechanical reason- 
ing 4 indicates E, = (j + 3/2)hv and g } — 1/2 (J + 1 )(j + 2). The 
result for the average energy is 



and again (63) is found to be satisfied. 

The reader will find some interest in comparing the average energy 
for each type of harmonic oscillator obtained in the present discussion 
with Jhe canonical distribution formulas in Chapter IV. Thus express- 
ing E in (65) in terms of T, the two-dimensional oscillator average 
energy is 

E = N(hv + j£ J^), (67) 


which is precisely the form of eq. (47) in Chapter IV with 0 = kT. 
It must be realized, of course, that E in (47) corresponds to our E/N 
in (67). The physical significance of average energy expressions like 
(67) will appear more clearly when we study the quantum statistics of 
radiation in the next chapter. 


4. STATISTICS OF AN IDEAL GAS 

Consider an assembly of N particles each of mass m and with 
negligible mutual interaction forces. The particles are confined in a 
vessel of volume t. This can be represented symbolically by assuming 
that the potential energy for any particle is V(x, y, z) = 0 every- 
where inside the vessel but rises abruptly to infinity at the walls 
and maintains this value everywhere outside, implying that no par- 
ticle is able to escape from the vessel. In applying the Darwin- 
Fowler method to such an assembly the essential matter is the evalua- 
tion of the partition function Z (eq. 11). The energy values E } no 

4 Cf. for example, L. Pauling and E. B Wilson, "Introduction to Quantum 
Mechanics," McGraw-Hill, New York, 1935, p. 100. 



Sec. 4] 


STATISTICS OF AN IDEAL GAS 


171 


longer form a discrete set but are continuously distributed. We have 
indeed 

E “ i (/> - + + y< *)• (68) 

If we suppose that the phase space of the assembly is divided into 
cells with phase volume A ]t where 

A) = A Px: A Pvi A P» l Ax ) Ayj A Zj, (69) 

and assign one set of momentum and coordinate values to each cell, 
we can write 

{p * + p2> + p2,]) + V(X) ’ yi> Zj) - (70) 
From Sec. 1 the elementary weights are given by 

& = fr m 


The partition function for this cell distribution becomes 
Z' = — . 2A. 

2.3 " 


(72) 


As the cells decrease in size and increase in number the sum above 
goes into an integral over the phase space, viz , 

Z = y 3 ffffff z(p * +P ' +p ‘ )/2m+r<z ' * J) dp x dp v dp. dx dy dz, (73) 


where, of course, it is essential not to confuse the z which is the basic 
independent variable in Z with the space coordinate z. The volume 
integration is to be conducted over the whole physical volume r and 
the momentum integration from — oo to +co for each component. 
This expression can be materially simplified by noting that 



v< *' »« *> dx dy dz = r, 


(74) 


since V = 0 everywhere inside the vessel. Therefore 
Z = ~ fff z^ +p l + fi /2n dp x dp v dp,. 


If we utilize the fact that 

fcx» _ -6*»logl/a 

a =6 , 


( 75 ) 
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we can express the partition functioa in the form 
+ 00 

Z - ^ fff e ~ ****-- 1/1 dpx dp y dp., (76) 

— « 

which at once yields 

_ t / 2irm V 4 . 

z - h >- W • <77) 

In the interpretation of this expression it must be recalled that z < 1. 
We can now use the fundamental formula (24) to obtain the average 
number of particles in the phase element dp^dp^dp. dx dy dz. This 
number is AN, where 

N . d P x dpy dpi dx dy dz f(p | +p * +p J )/2m 

AN - t—rr — (78) 


r / 2 win V 4 

A 5 Viog l/t) 


Therefore the probability of finding a particle within this phase ele- 
ment is 

P = ~ = ■ t lpl+P *' + * )/2m -dp x dp v dp, dx dy dz. (79) 


The probability that a particle shall have its momentum components 
included in the interval p x , p x + dp x , with no restriction on its posi- 
tion in the vessel, etc , is then 

P' = fff Pdxdydz = ^l+v\+v\)/2» dpx dpu dpf m 

Placing p x = mv x , etc. and f = e~ 1/kT from (63), we obtain P' in the 
form of the Maxwellian velocity distribution already discussed in 
Chapter V (cf . eq. 5 1 of that chapter) and again by the Gibbs method 
in Sec. 10 of Chapter VI, viz., 

P' = e- m ^ +, l + ^ /2kT dv x dv v dv,. (81) 

\ 27 xkT / 


Again we see that the connection between f and T in the form 
£ = e~ 1/kT is definitely indicated. Still another way of looking at the 
same matter is provided by the classical equipartition principle, which 
in the present instance takes the form 


E = 


3 NkT 


(82) 


2 
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Let us calculate E on the Darwin-Fowler method and see what form 
t must have to satisfy the equipartition principle. We have 

E = log Z(r). (83) 

If the reader substitutes for Z the value given in (77) and performs 
the indicated operations he will come out with 



Equating this to 3/2 NkT gives again 

f = e~ 1/kT , (85) 

which completes our validation of the connection between the Darwin- 
Fowler parameter f and the absolute temperature. 

The reader will probably have noted the close connection between 
the partition function (77) and the so-called distribution function 
obtained in Sec. 4, Chapter IV (eq. 83) for an ideal gas on the Maxwell- 
Boltzmann statistics. By allowing for the difference in notation for 
the physical volume the two expressions are in complete agreement. 
It is true that the general defining expression for Z in Maxwell- 
Boltzmann statistics in eq. (53), Chapter IV, appears not to agree 
with eq. (11) of the present chapter. The reason is to be found in 
the different definitions of the g, in the two methods. In the Maxwell- 
Boltzmann statistics the g, are genuine mathematical a priori proba- 
bilities which are proper fractions, while in the Darwin-Fowler method 
the gj are elementary weights which are integers. This explains the 
appearance of the factor p in the earlier definition of Z. Actually 
there is complete agreement between the two points of view and 
further evidence of this will appear as we proceed. 

S. THE CONCEPT OF ENTROPY 

We must now see how the idea of entropy fits into the Darwin- 
Fowler statistical method. Somewhere in the theory we must find a 
quantity which for an isolated system tends to increase. This quan- 
tity must finally enter into equations which are formally equivalent 
to the relations of thermodynamics. 

The reader will recall from Chapter VI (Sec. 13) that Gibbs intro- 
duced — kv = k(E - p)/& as the statistical mechanical analogue of 
entropy. This quantity was found to possess the necessary qualifica- 
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tions stated in the preceding paragraph. It has indeed the interesting 
property that it can be defined for non-canonical as well as canonical 
distributions and hence applies to both equilibrium and non-equilibrium 
states. In a certain sense therefore it overpredtcts experience, since in 
thermodynamics the entropy is defined for equilibrium states only. 
Since the Darwin -Fowler method confines itself to the calculation of 
averages over actual assemblies of systems, where these averages are 
then interpreted as the measured values of properties of the assemblies 
in states of equilibrium, we should expect that the Darwin-Fowler 
definition of entropy will apply only to equilibrium configurations. 
This has the possible advantage that it does not transcend experience 
like the Gibbs theory. On the other hand it has the disadvantage 
that no matter what precise definition is chosen, we cannot hope to 
show that the value of the entropy increases with the passage of time. 
All we can hope to do is to show that there is an increase in entropy 
when two assemblies in equilibrium are combined to form a new 
assembly. 

In view of the close fundamental connection between the Darwin- 
Fowler method and the classical Maxwell-Boltzmann statistics we 
expect that the definition of entropy in the former will follow the 
example set by the latter. It is natural to replace the “ statistical 
probability " w (eq. 49, Chapter IV) by the expression 2 W, the sum 
of weights entering into the denominator of eq. (4) of the present 
chapter. As before we shall divide 2 IF by N\ and finally define 

5 = k log 2IF - k log N !, (86) 


where k is, as usual, the Boltzmann gas constant. The analytical 
expression for 2 IF in terms of a contour integral has already been 
given in eq. (12), Sec. 2, and the value in terms of the parameters 
of the assembly has been calculated in eq. (28), which we set down 
here again for reference 


where 


zw = [z(f)] w rV0(f)/2^V'(f), 


*(T) = 


ZiX) 

^B/N > 


and denotes the second derivative with respect to f. If we 

take the logarithm of 2 IF, we obtain 


log 2 IF = N log Z(£) -E log f + i log Wt)/2rNp+"(t)]. (87) 
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Let us look into the magnitude of the last term. Substitution of the 
expression for $(f) given above yields for this term 

$ log Z(f) — 1°K f — i l°g 2iriV — log f + ^ log f 

- * log [z"(r) + 1 (| + 1 ) zr 2 - “ zr 1 ]. m 

The first, second, fourth, and fifth terms in (88) are very small com- 
pared with the first two terms in (87). Now we write further 

z'(r) = sEjfor*’- 1 ; z"(r) = ze ^ e , - i) & f*»- a 

whence the bracket term in (88) becomes 

- i log r* + f 0 + |) z - 0 + f ) SW']. 

We can further write 

safer** = £% 

where I? is a kind of mean-square energy value, while 

S E } g,!* = £Z, 

where E = E/N. Consequently the bracket term becomes 
- £ log \E? - £ 2 ]Z + logf. 

Now certainly E? — E 2 is of the order of magnitude of £ 3 or smaller. 
In neglectingjthe bracket we shall then be neglecting terms of the 
order of log E and log Z which are small compared with N log Z. 
The upshot is that, if we disregard the constant term — log 2ir N 
(which since it is a constant will play no role in entropy changes and 
which in any case is small compared with the terms retained as long 
as N is large), we can get a very good approximation to log ZW by 
retaining only the first two terms in (87) and writing therefore 

log 2 W = N log Z(f) - E log f. (89) 

This should be compared with (63) in Chapter IV. 

Let us suppose that we have two assemblies composed of different 
types of systems and imagine that the two assemblies are joined to 
form a single one. From Sec. 3 we are justified in replacing the par- 
tition function Z by Zi l/N Zji i/N , where Zt and Z a are the partition 
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functions for the individual assemblies before the combination. Hence 
log 2W now becomes 

log 2W = log Zx(f) + N a log Z 2 (r) - £ log r. (90) 
Here, of course, we also have 


E = Ei + £2, 


where £1 and £3 represent the energies of the individual, isolated 
assemblies. 

Before combination the entropies of the individual assemblies are 
respectively 


51 = k[Ni log Zi(fi) - £1 log - log Ni !], 

5 2 = k[N 2 log Z 2 (f 2 ) - £ 2 logf 2 - log iV 2 !], 


(91) 


where we have used different values of f since the temperatures of the 
assemblies need not be the same. Let us find the condition laid on fi 
in order that Si shall be a minimum. This is clearly 


d-Si _ , F ffii dZ|(fi) _ £i l _ 

dfi Lzi(f,) dil ft J " 

which yields the energy value 

dZi(fi) 


(92) 


(93) 


which is just the energy value for equilibrium, as already determined 
previously in eqs. (20) and (48). Hence S\ has a stationary value for 
the value of f for which the first assembly is in equilibrium with total 
energy £1. That this stationary value is a minimum is clear from the 
form of Si. In fact (91) allows us to write 


Si - k log 


[Zifo)]* 

Ni'.Si*' 


= k log 


[Xh )]" 1 

Nil 


(94) 


But we know from the method of steepest descents in Sec. 2 that 4>(z) 
has a minimum on the real axis for the value z = ft. This assures the 
minimum property of Si. In similar fashion we can show that S a 
has a minimum for the value of £ for which the second assembly is in 
equilibrium with total energy £3. 

Now let the two assemblies be joined to form a single assembly and 
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let t correspond to the temperature of the combined assembly. We 
then have for the entropy 

5 = k[N t log Zi(rt + N 2 log Z 2 (r) 

— (Ex + E 2 ) log f — log Ni ! — log N 2 !]. (95) 

This can be written as the sum of Si (£) and S 2 (£), i e., the sum of the 
entropies of the two isolated assemblies at the same temperature, 
where, of course 

Si(£) = KNi log Zite) — Ei log f - log N x !] 

(96) 

Sate) = k[N 2 log Z 2 te) ~ E 2 log f - log N 2 

Now we have just shown that Site) has a minimum for f = f x corre- 
sponding to (93) and S 2 te) a minimum for f = f 2 . Therefore unless 
ti ~ ta = £ it follows that 

-SiG-i) S 2 te2)<S 2 te). (97) 

Consequently except for this special case 

Ste) = Site) + Sate) > Sxtei) + Satea). (98) 

This shows that the entropy after the combination of the two assem- 
blies is greater than the sum of the individual entropies previously. Of 
course, if the two assemblies are already in equilibrium at the same 
temperature, the total entropy is unchanged, in agreement with the 
usual thermodynamical result. 

The final step is to show that the Darwin-Fowler entropy S = 
k log 2 W — k log N\ satisfies the fundamental thermodynamical rela- 
tion (the first law) 

dE + AA = TdS, (99) 

where we here temporarily denote the element of work done by the 
assembly by AA to avoid confusion in notation. 

For simplicity let us assume that the assembly consists of systems 
of only one kind. The possible energy states E, will be functions of n 
external parameters which we shall denote by fj ■■•(«, e g , the vol- 
ume of the region occupied by the assembly. If no external influence 
is brought to bear on the assembly, the {’s remain unaltered and with 
them the E, values. Since the partition function Z depends on the 
energy states, it likewise is a function of the (’s. Thus the total energy 
is given by 

„ NtdZte,fi,---ti 
e = T 4 


( 100 ) 
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and the change in energy associated with change in the temperature 
parameter f and changes db, • • in the external parameters can 
be conveniently written in the form 


dE 


+«E 


logZ /*rl 


i - 1 


a 2 log z 
Mb 


dt,- 


( 101 ) 


BEi. 


We calculate AA by noting that is the force associated with the 

BE t 

change in E % due to unit change in Thus db is the work done 

by a system of the assembly in the state E, when b changes by 
The total work done by such a system when the changes db, db, ■ • -d£ n 
take place is then 

-S3* 


Now on the average there are 


N 

’ Z( f) 


( 102 ) 


systems in the assembly in the energy state E t . Hence the total con- 
tribution of these to the work done is 


W* y 9 E, 
Z(S) 3b 


(103) 


and the work done by the systems in all states becomes 
N V' e V"' BE t , 

za) 2 ’ S ir. ~ aa - 


BE X 

frf db 

Now from the definition of the partition function it follows that 


(104) 


to _ 

T.- 


log Z(f, fi, • • • b)db 

1 v 5 ^ BZ 1 v'' e , 3E X , 

(105) 
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Hence by comparison with (104) we obtain 


Therefore 
AA + dE 




I . 9 2 Z V' d ‘ l0K Z rr 


(106) 


L ar 


i a 2 log z 


(107) 


Now let us go back to the Darwin-Fowler entropy definition (86) and 
write 

5 = kN log Z — i kNf log f 3 — — k log Nl. (108) 

®f 


The change in S corresponding to changes d£ and d( j, • • • then 
becomes 


dS = - ANlogf ^|pdf 

- an dt - km log f # 
of of 

a log z a 2 log z 

- ‘"g it *• - kN! ' IJ-w' *■ 

Reduction of (109) and comparison with (107) yields finally 

dS 


k logf 


— AA -{- dE. 


(109) 


( 110 ) 


But since log f = — 1/kT, this is equivalent to (99), further validating 
the Darwin-Fowler definition of entropy. 

A simple illustration of the preceding considerations is provided by 
the ideal gas of Sec. 4. The substitution of the partition function (77) 
into the entropy expression (108) immediately leads to 

S = kN log ^ ^ (2irmkT)** J + — + kN, (111) 


which is identical with (88) of Chapter IV. If we form the total differ- 
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ential, treating t, the volume, as the sole external parameter, we 
obtain 

I IT TJI 

TdS = — - dr + I kNdT - %dT 4- dJE. (112) 

T I 

But we recall from (82) that E, which represents the average total 

3NkT 

energy, for an ideal gas is — - — and from this 

kNT 

TdS = - — dr + dE. (113) 

T 

From the equation of state of the ideal gas, however, pr — NkT ; 
therefore kNT/r = p and pdr — dA and Eq. (113) becomes equiva- 
lent to (99). 

6. THE PARTITION FUNCTION AND GIBBS’S PHASE INTEGRAL 

The reader will have observed the close connection between the 
analysis in the preceding section and that in Sec. 3, Chapter IV. There 
we were still using w for the statistical probability but the analogy 
between this and 2 IF is very close. The question arises as to the 
connection between the Darwin-Fowler partition function and entropy 
and Gibbs’s statistical mechanics. In the first place we note the inter- 
esting mathematical similarity between the partition function 

Z = Zg } t~* i/kT 
and the Gibbs phase integral 

I = e -* /B =/ e~ H ' B d4>. 

In fact for a system composed of free particles, in evaluating Z we 
actually effectively computed I. It is well to note, however, that the 
two differ in their logical basis. In the Gibbs phase integral H is the 
Hamiltonian function for a dynamical system described by an ensemble 
of elements distributed throughout phase space. The various values of 
H for different parts of phase space are not different values of the 
energy of a particular component system; they refer to the fictitious 
elements of the ensemble. In the partition function, on the other hand, 
the Ej are the possible energy values of a component system forming a 
constituent of a whole assembly of systems. We recall that it is the 
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assembly of component systems in the Darwin-Fowler method which 
corresponds to the actual system described by the ensemble in the 
Gibbs method. Hence we expect that the formal analogy between I 
and Z will be of validity only for systems which can be described by a 
Gibbsian ensemble constructed for a single constituent of the system 
without bothering to construct an ensemble for the whole system. We 
found (cf . Sec. 10, Chapter VI) this to be possible for a system of free par- 
ticles. For a system of interacting particles, however, this procedure 
could not be carried out and here the analogy between I and Z could 
not be logically maintained. This naturally will not prevent the replace- 
ment of the summation in Z by an equivalent integration whenever this 
proves to be mathematically more convenient. Moreover it is quite 
conceivable that the simple Darwin-Fowler method may be general- 
ized to deal with systems in which the constituents interact with each 
other and for which in the partition function the E, will refer to possible 
energy values of the whole assembly. As a matter of fact Darwin and 
Fowler do use such partition functions occasionally. Their theoretical 
justification will presumably rest ultimately on an appeal to the 
Gibbsian ensemble concept. 

In the next chapter we shall consider in detail the modification 
introduced into statistics by the advent of quantum mechanics. There 
we shall find it convenient to use as a framework the method of 
Darwin-Fowler or its equivalent, the classical method of Chapter IV. 
This must not be interpreted to mean, however, that it is impossible 
to develop quantum statistics by a generalization of the method of 
Gibbs. The close connection between the partition function and 
the Gibbs phase integral which we have just stressed suggests 
the possibility of translating the Gibbs statistical mechanics into the 
quantum mechanical terminology. This will involve the replacement 
of the concept of the motion of ensembles in phase space by that of the 
existence of definite and often discrete quantum states which alone 
specify the possible motions of physical systems. Integration of 
quantities over phase space will be replaced by summations over the 
discrete quantum states. In certain cases, e.g , at high temperatures, 
the quantum states may be crowded together so closely that for prac- 
tical purposes the summations may be replaced by integrals. In these 
limiting cases one therefore expects that quantum statistics will lead to 
the same result as classical statistics. 5 

•For further discussion of this point, cf Mayer and Mayer, "Statistical Me- 
chanics," pp. 123 ff, 218 ff, 240. (John Wiley & Sons, 1940.) 
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PROBLEMS 

1. Consider an assembly of N systems in which each system is a rotator about a 
fixed axis. Let there be » types of rotators with frequencies ri, *)•••*« and the 
numbers of each type Ni- • N n . Use the energy values given by the classical Bohr 
quantum theory. Find Njr and £y. 

2. Solve Problem 1 for the case in which the rotators have two degrees of freedom. 
For the energy values, cf. Lindsay and Margenau, “ Foundations of Physics,” p. 433. 

3. Find the partition function and the average energy for an assembly of N two- 
dimensional harmonic oscillators. 

4 . Solve Problem 3 for the case of N three-dimensional oscillators. 

5. Find the expression for the entropy of an assembly of N simple harmonic 
oscillators of the same frequency in terms of the total energy and the temperature. 
Solve the same problem for an assembly composed of n types of oscillators with 
frequencies v\, ** • • y„. 

6. Find the expression for the specific heat at constant volume for an assembly 
of N simple harmonic oscillators of the same frequency 

7. Find the expression for the specific heat at constant volume for an assembly 

of N simple fixed axis rotators with frequencies »j, • p n . (Cf Problem 1 ) 



CHAPTER VIII 


FUNDAMENTALS OF QUANTUM STATISTICS 

1. REVIEW OF QUANTUM MECHANICS 

The Darwin-Fowler method of developing statistical mechanics is 
well adapted to handle the modification in classical statistics brought 
about by the introduction of the quantum theory. Before we embark 
on the description of quantum statistics, however, it will be desirable 
to review briefly the fundamental principles of quantum mechanics . 1 

Quantum mechanics like its classical prototype deals with physical 
systems described by means of coordinates and conjugate momenta: 
we shall still be dealing with the q } and p, and the number of degrees of 
freedom of the system under discussion will still be denoted by/. The 
various properties of the system such as its total momentum, energy, 
angular momentum, are called observables, and it is the task of 
quantum mechanics to predict the allowed numerical values of these 
observables as well as their average values. In the first part of this 
program it differs decidedly from classical mechanics since in the latter 
all real values of observables are possible. In the second part of the 
program it reminds us of the fundamental problem of statistical 
mechanics. But the concept of the state of a physical system in quan- 
tum mechanics is very different from that in classical mechanics. In 
classical mechanics we know the state of a system if we have given 
the instantaneous values of the g’s and the p’s which characterize it, 
in other words, its phase. In quantum mechanics, on the other hand, 
the state is characterized by a certain function of the coordinates, 
known usually as a state function or <f> function , 2 the only restrictions 

1 A more extensive survey, well adapted to the purposes of the present work, 
will be found in Chapter IX of Lindsay and Margenau’s “Foundations of Physics.” 
For the professional treatises on the subject the reader may consult Dirac’s 
“Principles of Quantum Mechanics,” Oxford Uruv Press, second edition, 1935; or, 
as more suitable for the general reader, Kemble’s "Fundamental Principles of 
Quantum Mechanics,” McGraw-Hill, 1937. 

*The <t> function is also often called a “wave” function The reader must be 
careful to distinguish between the state function of quantum mechanics and the 
state variables of thermodynamics (Chapter III). 
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on which are that it must be single-valued and quadratically inte- 

grable, i.e., J' p*$dr exists, where <t>* is the complex conjugate of <f> and 

dr is the element of volume in configuration space, that is, the space 
of the q's. This purely abstract characterization must, of course, be 
supplemented by the statement that quantum mechanics provides a 
way of assigning proper state functions to systems and ways of using 
the <t> functions to calculate possible and average values of observables. 

The process indicated is carried out by assigning to each observable 
an operator, e.g , for the component momentum in the x direction p s , 

h d 

the differential operator — — is chosen, where h is Planck’s constant, 
2 rrt dx 

and for the energy, the Hamiltonian operator, which is the Hamilton- 
ian function with each momentum component in it replaced by its 
appropriate operator. One of the fundamental assumptions of quan- 
tum mechanics then is that the only possible values of an observable p 
for a particular system are the characteristic values of the equation 

P<t> = pp, (1) 

where the left-hand side consists of the result of operating on the 
<t> function with the operator P characteristic of the observable p and 
the right side is simply the numerical value of the observable multiplied 
into the <t> function. In general it is found that only for certain values 
of p in (1) is it possible to obtain solutions for 4> which satisfy the 
fundamental restrictions mentioned above. These are the possible 
values of the observable and the corresponding <t> functions, usually 
denoted now as p functions, are the corresponding state functions for 
the observable. Thus if \pk is the state function corresponding to the 
value pt of the observable, 

Pfa = Pk'Pk (2) 

is an identity with fa satisfying all the fundamental conditions imposed 
on state functions. It is customary to refer to the values />* as the 
eigenvalues of the observable and the corresponding functions fa as the 
eigenfunctions or eigenstates of the observable . F or atomic problems the 
most important form of equation (1) is that for which the observable 
is the energy. It then becomes the Schrodinger equation 

H<)> = Ep, (3) 

in which H is the Hamiltonian operator and E the numerical value of 
the energy. The eigenfunctions fa are functions of the configurational 
and spin coordinates of the system but they are also characterized by 
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certain parameters, called quantum numbers. These are usually 
represented by n (principal quantum number), l (azimuthal quantum 
number), in (magnetic quantum number) and $ (spin quantum num- 
ber). The eigenvalues E are of course characterized by different values 
of n, l, m, s. 

As a simple illustration the harmonic oscillator may be cited. Here 
the Hamiltonian operator is 

rr = ±(A.±\\k 2 

2m \2irt dx) 2 * ’ 

and eq. (3) becomes, with p in place of <j> to conform to popular usage, 
dty , 8ir 2 m / 1 , 

5 ? +_ 4 5 ~(. £_ 2 te )' 1 ' -0 - (4) 

The eigenvalues of E turn out to be 

E n = (n + %)hv, (5) 

with v — frequency of the oscillator = ~ >/ k/i n. The eigenfunctions 


2tt 


are 


in ~ C n H n (\T x ) e - 


***/2Ar 


( 6 ) 


where 


H n (>£‘) is the so-called Hermite polynomial of order n, 

the most compact representation of which is 

„ d n (e~ e ) 

= f-lV****^ — 


//„({) = (-l)"e f 


dr 


(7) 


The final fundamental postulate of quantum mechanics says that 
when a system is in the state characterized by 4>, the expected average 
of an observable p from a series of measurements is given by 


P = 


f 

f 


<t>*P<t>dT 




( 8 ) 


Here as usual P<t> is to be interpreted as the result of operating on <j> 
with the operator P and it must be recalled that <t> need not be an 
eigenstate of the system for the observable p. The most direct signifi- 
cance of the state function <t> is found in the quantum mechanical 



186 


FUNDAMENTALS OF QUANTUM STATISTICS [Ch.VIII 


theorem that * X/) measures the probability that 

the system in the state <f> will have its coordinates qi • • • qj equal to 
Xi • • • X/ respectively. More strictly <£*(Xj ■ ■ • X/)^(X i • • • X/)dgi • • • 
dqj is the probability that the system will have its coordinates in the 
range dqi • • • dqj in the neighborhood of the values Xi • X/. In order 
that the system shall not be found at this place it is essential that 
vanish there. 

In order to understand quantum statistics it is necessary to consider 
N identical, indistinguishable physical systems forming an assembly in 
the Darwin-Fowler sense. Since the systems are identical the Hamil- 
tonian has the same form for all, though each will be a function of the 
coordinates of the particular system in question. If all the coordinates 
of the jth system including the spin coordinates * are represented for 
simplicity by q } , and if each system is considered isolated from all the 
others, we shall have for the description of the energy behavior of the 
assembly the N Schrodinger equations 

= ■E.V'.(ei). 

H 2 +, (to) = Ejijito)* (9) 

H vtkiqff) = Ek'l'kiqff), 

where the sequences of the eigenvalues E„ E, • • • are really the same 
set in every case. The same is also true of the eigenfunctions f,, • • • , 

except that each is a function of the coordinates of a single system. 

If now we think of the assembly as a single system without however 
contemplating the mutual force interactions between the individual 
constituent systems, i.e., still envisage them as relatively far apart, 
the resultant Schrddinger equation will be 

(Hi + Hi + • + Hs)^(qu • ■ Sat) = Ei(qi, ■■ gjv), (10) 

where 4>(qi, • ■ • qif) is the eigenfunction for the assembly and E the 
corresponding eigenvalue. On examination eq. (10) is seen to be satis- 


fied by 





l#' = ^.( 21 )^ 7 ( 22 ) • 

• • ^(2at). 

(11) 

with 





£ = £, + £, + • 

• • + £*. 

(12) 


This may be interpreted as meaning that ^ is the eigenstate of the 
assembly in which the first system is in eigenstate corresponding to 

* Cf. Lindsay and Margenau, op. ct p. 478. 
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eigenvalue E t , etc., and E as the total energy of the assembly is the sum 
of the eigenvalues of the individual systems. Suppose we interchange 
two systems of the assembly so that system 1 now has energy E } and 
system 2, energy £,. The total energy E is unchanged but \f> becomes 

V = ^»(2a)^;(gi) ■ • • 'I'ki&ff), (13) 

which in general is a different function from in (11). Thus we have a 
different eigenfunction for the assembly corresponding* to the same 
eigenvalue. This corresponds to what is called degeneracy in the state 
of the assembly. The degree of the degeneracy is the number of 
different eigenfunctions with the same eigenvalue. Clearly in the 
present case this number is N !, since there are N ! ways of permuting 
the N sets of coordinates of the individual systems If we denote any 
one of the functions obtained by such a permutation by Vp, it follows 
from the linearity of the equation (10) that the linear combination 


¥ = Sop'J'p 


(14) 


is also an eigenfunction of the assembly corresponding to the energy E 
(assuming that the coefficients ap are so chosen that ¥ is normalized, 4 

1 ■Z-'J' y*ydr = 1 ). 

We now introduce the Pauli exclusion principle which cuts the 
number of possible V functions in (14) to one by means of the following 
postulate : 

If the individual systems are elementary charged particles (in par- 
ticular electrons or protons) the only combinations of the form (14) 
realized in nature are anttsymmetrical with respect to an interchange of 
the coordinates of two systems, i.e., such an interchange produces a 
change in sign without changing the value. Of all possible combina- 
tions of the form (14) there is only one which is antisymmetrical and 
this may be written in the form of the determinant 


MSi^iCSs) • 'PMn) 
MsiWAte) ••• Mw) 


(15) 




where c is a constant. An interchange of two q’s is equivalent to an 


4 Cf Lindsay and Margenau, op. cit , p, 413. 
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interchange of two columns of the determinant and this leads merely 
to a change of sign. It is an interesting consequence of the Pauli 
principle, that if two of the charged particles are in the same state, i.e., 
if * = j, for example, two rows of the determinant become equal and 
the determinant vanishes with the concomitant vanishing of This 
may be interpreted to mean that it is impossible for two such charged 
particles in an assembly to be in identical states. This is often pre- 
sented as thewstatement of Pauli’s principle. It is to be noted that the 
principle allows two elementary charged particles to be characterized 
by the same n, l, and m values provided the spin quantum numbers are 
different. There are only two possible values of the latter and their 
difference is.interpreted physically as an opposition or antiparallelism 
of the direction of spin. It often happens that the numerical value of 
the energy depends very slightly on the spin and in that case the Pauli 
principle allows us to think of two elementary charged particles in 
practically the same energy state, their spins being opposed. 

Suppose in (15) we interchange two pairs of q's, i.e., qi with g 2 and q 2 
with q 3 . Since each interchange involves a change of sign with no 
change in magnitude, the two interchanges will leave the sign unaltered. 
The state function 'P is symmetrical with respect to interchange of two 
pairs of elementary charged particles. If then each individual system 
of the assembly consists of a pair of such charged particles or indeed 
any even number of them, the wave function of the assembly must be 
such that an interchange of two systems leaves it completely unaltered. 
To represent such an assembly of composite systems we need a sym- 
metrical state function in place of the antisymmetrical one in (15). If 
the i^’s in (15) are still interpreted as representing the eigenfunctions 
of these composite systems we can easily get such a symmetrical func- 
tion from (15) by changing all the minus signs in the determinant 
expansion to plus. It can be shown indeed that the symmetrical 
eigenfunction thus obtained is the only possible symmetrical one. An 
illustration of a composite system of this type is provided by the 
deuteron, the nucleus of the hydrogen isotope of mass 2. Many 
neutral atoms are also of similar character. 


2. DISTRIBUTIONS IN QUANTUM STATISTICS 

We are now ready to apply the quantum mechanical ideas of the 
preceding section to statistics and statistical distributions. We have 
just seen that the state of an assembly of elementary charged particles 
is given by the one antisymmetrical linear combination of eigenfunc- 
tions for the individual systems corresponding to the particles. How 
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does this affect the statistical weight attached to the assembly, i.e., (1) 
of Chapter VII? For convenience we rewrite this here, viz., 


W = 


N±lM 

iV 0 ! JVi ! ... N r \ 


(16) 


But this is just the total number of independent state functions which 
can be formed by taking the product of all the functions for the 
individual systems with all possible permutations of coordinates. To 
see this, note that a product of the kind in question is like (11), i.e., 

• • • 'Pkig.N)- 

There are N\ possible products of this kind obtained by permuting 
the qi — qv in the individual functions. If the individual energy 
states, however, are themselves degenerate and the degree of degen- 
eracy of the xth state is g„ etc , so that there are g, independent state 
functions corresponding to the ith energy value, the number N\ must 
be multiplied by the g’s where, moreover, each g is raised to the power 
of the number of systems in the corresponding state. This alone would 
give us for the number of state functions required 

Nltf'g?' •••**' (17) 


With No systems in the zeroth state, Ni in the first state, etc., a 
product like (11) becomes, for example, 

fo(si) i^oGza) • ’ • {qN a +i)ii (av 0 +a) • • • ii iqN x +N t ) 

Now the No 1 permutations of the q it g 2 , • • • qx 0 among the first N 0 
factors do not produce new and different states. As a result (1 7) must 
be divided by 

N 0 \Nil • • • N r \ • " 


in order to get the actual number of independent state functions repre- 
senting the state of the assembly for a given value, E, of the energy. If 
it were not for the Pauli exclusion principle we should expect to use 
the statistical weight (16) in our statistical calculations for quantized 
systems. However, the principle insists that actually the numbers 
No, Ni • ■ • N r • • • cannot exceed unity for any assembly realized in 
nature, and that for any set of values satisfying this criterion W — 1; 
otherwise W must vanish. It is clear then , that the exclusion principle 
forces us to abandon (16) as an expression for the statistical weight. 
The new situation may be expressed in the following way. 
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For an assembly represented by an antisymmetrical wave function 
the statistical weight W has the following values 

W = 1 for N r = 0, 1 (any r), 

W — 0 for all other values of N r . (18) 


With this we can now proceed to derive the corresponding distribution 
law. The average number of systems in the rth state is (cf. eq. (4) of 
Chapter Vli) 


N r 


2N r W 
2 W ' 


(19) 


We shall first calculate the denominator which we recall is the sum of 
all weights of the assembly subject only to the conditions (2) and (3) 
of Chapter VII. We can avoid the analytical complexity introduced 
by these conditions by noting that SIT may be expressed as the coeffi- 
cient of ic^z® in the expansion 

M = 23 Wx 1N ’z SN ’ B ’, (20) 

Nt.Ni V r ... 

where in the summation the N’s may be any positive integers and are 
no longer restricted by the conditions mentioned. Now M may be 
rewritten 

M = E n Wx?’ z*’®'. (21) 

Nt. Ni y T ■ i 

In the evaluation of the product, if in any factor N, is different from 0 
or 1 the whole expression vanishes since W = 0 unless N, is equal to 0 
or 1. Consequently (21) is the sum of all products of the form 

i 

where Nf = 0 or 1. This sum itself, however, is most simply written as 
a product, namely 

M = (1 + *Z®°)(1 + xz® 1 )( 1 + *8®*) • • • 

- U (! + *2*0- (22) 

J 

Thus the extreme terms in the sum represent respectively that in 
which every N, = 0 and that in which every N } = 1 . All other possible 
combinations are represented in the intermediate terms. 
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In Sec. 2, Chapter VII we showed that the coefficient of z* in the 
expansion 

Z N'.gg'g ? 1 ••• g? f 
N 0 \Ni ■■■ N r l 


gZNjBj m Z N 


L f z N * 

iriJc z* +1 dZ ' 


2ri 

Hence by the use of the same reasoning the coefficient of » e^z* in the 
expansion (20) for M will be given by 

M dx dz 


(sJXC 


* W+1 * I+I ’ 


(23) 


where C and C’ are closed contours about the origin in the complex 
plane and both x and z are considered to be complex variables. 

We next get the expression for the numerator in N r . Let us cal- 


1 dM 
log z dE r 


N r n 


■ E 

NtNi-'-Nr- 

But hN r W is just the coefficient of in the expansion on the right 

1 S h/T 

of (24) and therefore the coefficient of the same power in : — ■ 

v log z dE r 

Consequently 

/ 1 V C C 1 dAf dx dz 

rW " W J c Jc log Z ' dEr X N+l P+ 1 ’ (25) 


(24) 


„ r . 1 dM. , . .. , 

We can put the term ; • into somewhat more suitable form 


by writing 


log Z dEr 




(26) 


whence 


1 dM MxtF' ,, d , „ , 


The method of steepest descents may now be applied to the integral in 
(25) by treating the integrations with respect to z and x separately. 
Let the saddle point along the real axis for x correspond to * * = n and 

* The reader should be careful not to confuse the it of this chapter with that of 
Chapter IV. 
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the saddle point along the real axis for z correspond to z = f . Then 
since the function in the integrand can be shown to satisfy the require- 
ments of the method we can write at once 

* i loe (1 + - Tuh*- (28) 

This is the distribution formula for an assembly of elementary charged 
particles, ff we identify f with e~ 1/kT , as in the classical case of 
Chapter VII, (28) becomes 

e -B r /kT j 

~ 1/m + e~ Xr/kT ~ i/n-e Br/kT + 1* (29) 

Assemblies with this distribution law are said to obey the Fermi- 
Dvrac statistics. This should then apply to an assembly of electrons. 
We note at once the difference from the classical statistical distribution 
law (31) of Chapter VII. The quantity m appears as a new statistical 
parameter in addition to f. Its significance will be discussed shortly. 

Before investigating the application of (29) we ought to notice that 
there is another possible quantum statistical distribution law. This 
will hold for an assembly whose eigenfunction is symmetrical. Here 
the weight to be attached to the assembly is also equal to unity for 
there is still only one symmetrical state function associated with the 
assembly, but now all values of N r are possible since the symmetrical 
function does not vanish no matter how many individual systems are in 
the same state. Thus we write in place of (18) 

W= 1, all N r . (30) 

In eq. (21) we must therefore now remove the restriction that N } can 
have only the values 0 or 1 in order to avoid a vanishing product. Thus 
M can now be written as the sum of all products of the form 

IJ W’ E >, 

J 

in which N, may take on any values; but this itself may be expressed in 
the form of a product of sums, viz., 

M = (1 + *z*° + + • • -)(1 + xz Bi + 

=ni>v*. 

i ITo 


( 31 ) 
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Consequently since x and z are both less than unity in absolute value 

(32) 


- - n(r^) 


The evaluation of N r proceeds precisely as in the preceding case with 
1 dM Mxz* ' 


log z dE r 
The result is 

N r = 


1 - 


Mf*' 


— log(l — X^ T ) 
dx • 

(33) 

. - f £r ) 

(34) 


This is the distribution law for what is usually termed the Bose- 
Einstein statistics. With the usual substitution for f it becomes 


N T = 


1 


l/n m <F r/kT - 1 


(35) 


The only mathematical difference between (35) and (29) is the sign 
before the 1 in the denominator. This leads to a very fundamental 
difference in the physical meaning, however, as will appear shortly. 

It is instructive to observe that the expression for N r in the classical 
case of Sec. 2, Chapter VII (eq. 24 or 31) can also be obtained directly 
by the method of this section. For we can write 


M 


- E 


n 




N I: 


in the form 


W0.JV1 


n w 

k 

M = N ! S TT x N > 

N } 1 


X ZN, Z ZN,B, 


(36) 


Furthermore the sum of all the products indicated in (36) may be 
expressed as the product of sums, viz. 


m ■ n sf! * n ^ = iv ’ ! n 

As before we form 


(37) 


) n-0 


1 = M gr XZ*', * 


and finally get 


log z dE r 


N r = H&rS* r = Hg*~* r/kT - 


(38) 
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This is in the form of the distribution formula (24) in Chapter VII if we 


Z(S) 


(39) 


This helps to make clear the physical significance of the distribution 
parameter u in the general quantum statistical case. It must be so 
chosen that, 2 N r = N, the total number of systems in the assembly. 
This will serve to fix it in terms of N and f. The evaluation of u will 
be carried out in the following sections. Independent justification of 
the association of f with temperature in the quantum statistical case 
will also be given. 


3. ALTERNATIVE TREATMENT OF QUANTUM STATISTICAL 
DISTRIBUTIONS 

Before proceeding to apply the distribution formulas (29) and (35) 
for the Fermi-Dirac and Bose-Einstein quantum statistics to the 
properties of gases and solids it will be worth while to present an alter- 
native method of derivation. We shall here revert to the method of 
Chapter IV and shall not hesitate to use Stirling’s formula. Moreover 
we shall derive the distribution laws for all three types of statistics. 

We wish to distribute an assembly of N objects, e.g., material 
particles, into energy states in such a way that in the states between 
Ej and E, + dE } there are N } particles, etc. To make the discussion 
more pictorial we visualize the energy interval as a region in phase 
space, a kind of energy shell containing all values from E } to E t + dE r 
In classical statistics this shell is conceived to contain a continuous 
r,ange of energy values but in quantum statistics it is necessary to give 
it a structure and to suppose that it consists of cells associated with 
each one of which there is a definite possible state of a particle of the 
assembly. Let us suppose there are n, of these cells in the jtb. energy 
shell. We shall shortly derive an expression for the dependence of nj 
on Ej and dEj. 

We proceed first to distribute the particles in accordance with a 
suggestion of Brillouin.* Each cell is assigned a capacity dependent on 
the number of particles in it. In particular for a cell with p occupants 
the capacity is assumed to be 1 — pa, where a is a real parameter which 
may assume different values, to be discussed later. The weight 
attributed to the jth shell is the number of ways of assigning N } par- 
ticles to the n } cells in the shell subject to the above capacity limitation. 

• L Brillouin, "Lea Statistiques Quantiques,” Vol 1, pp 167 ff., Paris, 1930. 
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The first particle may be placed in n } ways, but the second in only 
% — a ways, since the cell already occupied has now a capacity of only 
l — a. The number of ways of assigning all N } particles will be denoted 
by Wj, where 

Wj = nj(n 3 - a)(n } -2 a) (», - (N, - l)a) = (40) 


It is, of course, assumed that n } — (N } — l)o > 0 and ttoat algebraic 
meaning can be associated with ( n 3 /a ) 1 etc. The weight corresponding 
to a distribution of the whole assembly in which there are No particles 
in the energy shell about Eq, Ni in the energy shell about jEj, etc. 
(with 2 Nj — N, and 2 N 3 E } — E = constant) is then the number of 
ways of assignment of the N particles to the shells multiplied by the 
number of arrangements among the cells in every shell. The total 
weight then becomes 

N\ tt a N ’(n,/a)l 

11 — ( 4l ) 


w = 


II".' 


(»,/*- A.,)! 


We now follow the procedure of Sec. 2 of Chapter IV and inquire 
for the distribution corresponding to maximum w subject to the usual 
conditions on the total number of particles and the total energy. Using 
Stirling’s formula and making some reductions, we have 


log w = N log N — N — 2 JV ; log N } + 1/o-S n } log - 

ft j **“ dJS j 

+ S N } log {n } — aNj). (42) 
We now set 3 log w = 0 subject to the conditions mentioned and get 

S 5 N, log = 0 (43) 

subject to 

2&N } = S E,6N, = 0. (44) 


Introducing the undetermined multipliers -yi and 72 and proceeding 
precisely as in Sec. 2 of Chapter IV we are finally led to 

^ = - _ Tl (45) 

nj a + e 

Nj/rij is the average number of particles per cell in the jth shell for the 
distribution corresponding to maximum w, i.e., that which we termed 
the canonical distribution in classical Maxwell-Boltzmann statistics. 
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Coupled with a knowledge of n } , eq. (45) constitutes the quantum 
statistical distribution formula analogous to the classical formula (25) 
of Chapter IV. If we set 


72 =~ 


1 

kT 


(46) 


the distribution formula becomes 




N,/n, = 


1 

o + e-VV‘ r ' 


(47) 


We naturally seek a connection between (47) and the distribution laws 
(29), (35) and (38) already derived in the previous section by the 
Darwin-Fowler method. Let us first look at the matter from the purely 
formal mathematical standpoint. If a — 0, (47) becomes 

Nj = n,c"r E > /hT . (48) 

Here N } has precisely the same dependence on E } as N r on E r in eq. 
(38). It evidently is the classical distribution law. The two formulas 
become identical if we identify ft g, in (38) with e yt n } in (48). What 
physical significance can we attach to the choice a = 0? It clearly 
corresponds to a constant capacity of unity for every cell independent 
of the number of particles in it. But this is just the classical assumption 
that every cell has the same a priori probability. The weight g, is 
then simply the number of cells available with energy Ej and corre- 
sponds to n } . The parameter n will finally be associated with e\ 

In looking for a connection between ( 47 ) and the quantum sta- 
tistical formulas (29) and (35) we are naturally led to try the assump- 
tions a — + 1 and a ~ — L The form of N,/n, in (47) with a — 1 then 
looks much like that of N r in (29) and there is a similar resemblance 
between (47) for a — — 1 and (35), particularly if we agree to let e* 1 
stand for ft. Unfortunately we should naturally wish to associate N, 
in the one case with N r in the other, whereas the appearance of n } in 
(47) appears as a sort of stumbling block. This difficulty is cleared up 
when we reflect that the N r and the N } do not after all refer to the 
same thing. The rth energy state in the Darwin-Fowler method of 
Sec. 2 is a genuine microscopic energy level to which the specific 
energy value E r is assigned. In the Fermi-Dirac statistics only zero or 
one particle may exist in this state. On the other hand in the alterna- 
tive treatment of the present section Nj is the number of particles in a 
whole range or shell of energy values ranging from E, to Ej ■+■ dE } and 
nj represents the number of possible or allowed energy states' in this 
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range. Consequently the E r values in (29) refer strictly to the energies 
associated with the various cells in a particular shell in the Brillouin 
method, while the E } value in (47) refers to the average energy over a 
whole shell of states. If we like we may think of (29) as defining a 
microscopic distribution and (47) as defining a macroscopic distribu- 
tion. That the distinction does not appear to be necessary in the case 
of classical statistics where, as we have just seen in the preceding para- 
graph, we are able at once to identify formula (48) with (38), is not so 
surprising when we consider that in classical statistics there is no real 
necessity for subdividing energy shells into discrete cells; there is 
indeed no prescribed lower limit to the size of the energy interval and 
no reason why we should not identify E r in the one case with E, in the 
other. A similar statement holds for the identification of g, with n,. 
However we are naturally more interested in the quantum statistical 
case. With the distinction between the two alternative points of view 
held clearly in mind while we realize that there is no essential incon- 
sistency involved, we can proceed to use whichever form of distribution 
formula seems more convenient. For the present we shall continue 
our discussion on the basis of eq. (47). 

It is desirable, however, to pay a little attention to the physical 
significance which can be attached to the choices a — + 1 and — 1 in 
the Brillouin point of view. The assignment a = 1 implies that the 
capacity of a cell is unity when no particle is in it but drops to zero as 
soon as one particle enters. Consequently on this choice a cell may 
hold at most one particle ; there are only two possibilities, one or zero. 
The connection between this and the Pauli exclusion principle, which 
led to the Fermi-Dirac statistics from quantum mechanics, is clear. If 
on the other hand a = — 1 , the capacity of a cell with p occupants is 
1 + P, i e . the capacity of the cell increases with the number of occu- 
pants. The connection with the Bose-Einstein statistical assumption 
expressed in (30) is not indeed so clear as in the corresponding case of 
o = l and the Fermi-Dirac statistics. Nevertheless there appears to 
be no essential inconsistency between the two points of view. More- 
over we are not restricted to the values of a = + 1 and —1. For 
example, 0 < a < 1 would imply a loosening of the Fermi restriction 
though at the same time allowing less freedom of occupancy of cells 
than the classical theory. Certain analytical difficulties with frac- 
tional a would indeed appear to arise from the fact that w, in (40) must 
be integral. Closer inspection shows that these can be overcome, 
though we shall not pursue this possibility here. 7 

»Cf. R. B. Lindsay, Phtl. Mag [7] 17, 264 (1934), also D S Kothari, PhU. 
Mag. 18 , 192 ( 1934 ). 
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The next step in the exploitation of the distribution formula (47) 
is to establish the connection between the parameter 73 and the tem- 
perature in a straightforward fashion. We shall use the elementary 
statistical-thermodynamical analogy (cf. eq. 68, Chapter IV) 

dE 

kd log u» = y , (49) 

where dE is the change in internal energy due to flow of energy, e.g., 
heat (not change in internal parameters). Since (49) holds for equi- 
librium states only we must use the maximum log w, i.e., that corre- 
sponding to the distribution law (45). The change in log w associated 
with changes dN, in the number of particles in the jth shell due to the 
inflow of heat is 

d log w = ZiN, log (SO) 

_ , n, — aN, _ TT 

But log — = — 7i — 72 ^ 3 . Hence 

d log w = — S(ti + y i E,)dN i = — 7 2 S E,dN } = — 7 2 dE. (51) 

We note that though the number of particles does not change when 
heat is added, the total energy changes by '2,E J dN J since the effect of 
the energy flow is to alter the number of particles in the higher energy 
shells. Hence the analogy (49) leads to 

1 


in agreement with our assumption (46). 

We shall conclude this section by a brief reference to the more con- 
ventional method of discussing the Fermi-Dirac and Bose-Einstein dis- 
tribution formulas in order to point out certain differences with the 
Brillouin method just described. 

Let us first consider the Fermi-Dirac case. The problem to be 
solved is still the distribution of N identical objects in energy shells 
with Nj in the jth shell and the number of available cells in the j'th 
shell being equal to n,. It is assumed that no more than one object 
can be put into any cell. The number of ways in which Nj objects 
can be placed in n } cells so that no cell contains more than one object 
is equal to the number of ways in which the n } cells can be divided into 
two groups with Nj in one group (those which contain one object) and 
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— N } in the other group (those which contain no object). From 
Chapter II this number is 

JVC*# - Nt ) ! 


Consequently the total number of ways of distributing all N objects is 



«,!• 


#>!(», - N ,) ! 


(41') 


On this view this should be the statistical probability or weight associ- 
ated with the distribution. It will be noted at once that it is not 
exactly the value (41) obtained above for a = 1. Rather, it is equal to 
w/NK 

We proceed next to the Bose-Einstein distribution by a similar 
direct method. We now wish to distribute N } objects among n } energy 
cells in such a way that no restriction is placed on the number of 
objects in any cell. This is clearly the number of combinations of n, 
things Nj at a time with all possible repetitions allowed. From the 
algebraic theory of combinations, this number is equal to the number 
of combinations of n, + N, — 1 objects N, at a time without repetitions. 
Consequently the required total statistical probability or weight 
becomes 


n (Nj + n, — 1) ! 


(41") 


Now if in eq. (40) we let a = — 1, which should correspond to the Bose- 
Einstein distribution in accordance with our previous assumptions, 
the value of w becomes 


w 



(N, + n, - 1) ! 

“ D! ’ 


which again is just Nl times the value (41") obtained by direct count- 
ing. 

The question arises: What is to be done about the factor Nl which 
occurs in the Brillouin method of computing the statistical weights but 
is absent in the direct counting scheme? It will be recalled that this 
same factor occurs in the expression for w in the classical Maxwell- 
Boltzmann statistics in Chapter IV. There we found its presence of 
no particular moment as far as the distribution function is concerned. 
It proved indeed embarrassing in connection with the statistical 
definition and evaluation of the entropy and free energy; we found 
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it necessary to use vi/N ! as an "effective statistical probability" in the 
definition of entropy. If we use the Brillouin expression (41) for w the 
same situation will prevail in quantum statistics. We could, of course, 
redefine the statistical probability to bring it into agreement with 
(41') and (41") and this will be the more logical course if we consider 
the quantum statistical distributions to be the fundamental ones. 8 
However we shall find it simpler to continue to use the Brillouin w and 
define entropy in terms of w/Nl. All applications will then be per- 
fectly consistent. 

4. EVALUATION OF tij FOR AN ASSEMBLY OF FREE PARTICLES 

Before we can apply formula (47) to concrete cases we must have 
n f or the number of cells in the jt h energy shell. This is the same as the 
number of possible energy values lying in the interval E } to E, + dE } . 
Its value clearly depends on the nature of the assembly. The method 
of evaluation will be quantum mechanical which is reasonable since 
we are talking about quantum statistics. We shall confine our atten- 
tion to an assembly of free particles all having the same mass, the 
statistical analogue of an ideal gas. The problem is to determine the 
allowed energy values of the assembly. Since the particles are assumed 
to exert no forces on each other, these values can be readily calculated 
by direct summation from the energy eigenvalues for a single particle 
confined to a closed vessel. Suppose for convenience the vessel is a 
rectangular parallelepiped with dimensions li,h,h. The Hamiltonian 
for the particle is (j% + p 2 + p 2 ,)/2m since, as the potential energy is 
constant it may conveniently be taken equal to zero. The correspond- 
ing quantum mechanical operator becomes (cf. Sec. 1) 

_ _*!_ (]P. . £_ . *!_ vs 

Sir 2 m Vdx 2 dy 2 dz 2 ) 8ir 2 tw 

The appropriate Schrodinger equation then becomes 

V 2 * + ^?Et = 0. (52) 

This may be solved by separation of the variables, i.e., by writing 
iK*. y, z) in the form of the product of three functions i x {x), 'pyiy), ^*(z) 
depending respectively on x, y, and z alone. Thus 

p = tx'Py'I'f (S3) 

•This, for example, is the view of Mayer and Mayer, ‘‘Statistical Mechanics,” 
pp. Ill ff. For another discussion of the {froblem see Brillouin, op. cit., Vol. 1, 
pp. 171 ff. 
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Equation (52) becomes consequently 

1 d\ X 1 jVy 1 

ix dx 2 ypv dy 2 dz? 


(54) 


where 8x 2 mE/h 2 has been replaced by k 2 . Let us write k 2 = <tf + x§ + *§• 
Then we merely have to solve the individual ordinary differential 
equations 


d 2 ** 2 , _ n . d 2 <p v 

+ kiV'x “ 0; 


^2 +4*s-0; ^-‘ + *1^ = 0. (55) 


The solutions are 

** = Axe'** + B x e~'‘ ,x 

tv = + V™ (56) 

t, = 


These solutions are subject to the boundary conditions 

tx = Oat* = 0, /i, 

tv = 0 at y - 0, / 2 , (57) 

tt = 0 at z = 0, / 3 , 

if we suppose that the container is placed with one corner at the origin 
and has the coordinate planes for three of its faces. These boundary 
conditions express the fact that the particle is confined to the vessel, 
i e. t there is no probability of its ever being found at or outside the 
walls. The conditions (57) lead at once to 

sin Kill = 0; sin k 2 Iz = 0; sin k 3 1 3 = 0, 


or expressed more explicitly 




Wlir 



«2ir 



» 3 ir 



(58) 


where »i, « 2 , and « 3 are any integers, positive or negative but not zero. 
The energy eigenvalues of the free particle in the vessel therefore are 

wi , «f , 

8 m\« + + HI 


"(ni, nj, ni) ~ 


(59) 


Corresponding to every set of integers n 1( n 2 , n 3 , eq (59) gives a 
possible energy value. The corresponding eigenfunction is 


_ . tiiirx . n 2 try . » 3 irs 

t = C sm — : — • sin — — • sm — — , 

n h h 


(60) 


where C is a constant determined from the normalization condition 
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expressing the fact that the probability of finding the particle some- 
where in the vessel is unity. This takes the form 


/'ll /'ll /’ll 

/ / / |^| 2 dxdy dz 

Jft %/n %/ft 


(61) 


Application of this yields _ 

C = = 2^, (62) 

v hhh t 

where r is the volume of the vessel. Inspection of (59) now discloses 
that because of the small value of h 2 /8m in absolute units, if r is large, 
i.e. the vessel of macroscopic size, 1 cm 3 or larger, the energy eigen- 
values are very dose together. In fact they may for practical pur- 



poses be considered effectively continuous.® In any case our problem 
is to determine from (59) the number of values of lying in 

the interval E, to E, dE } . For this purpose we set up a three- 
dimensional rectangular lattice (Fig 8 1) in which the sides of the unit 
cell in the three coordinate directions are l//j, I//2. and I//3 respec- 
tively. Every point in the lattice is given by the three integers n lt «2, n a 
and the distance of the point from the origin is 

»„«.) = (63) 

With each point in the lattice there is assoriated an allowed energy 
eigenvalue £<«„ by (59). Since change in sign of m, n a , «3, or 
any one of them, does not produce a new energy value nor a new 
independent state function we limit the lattice points under considera- 

' Cf. Lindsay and Margenau, op. cit., pp. 428 ff, for a discussion of this subject 
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tion to the first octant of the lattice space, i.e., that for which n 1( « 2 , n 3 
are positive integers. The number of eigenvalues lying in the energy 
interval cited is then equal to the number of lattice points lying between 
the octant of the sphere of radius r, corresponding to Ey and the octant 
of the sphere of radius fy + dry corresponding to £y + dE } . Now each 
lattice point can also be associated with a lattice cell like that shown in 
Fig. 8-1. Consequently the number of lattice points in the spherical 
shell octant just mentioned will also be equal to the number of lattice 
cells contained in it and this in turn will be given by the volume of the 
shell octant divided by the volume of the unit cell, namely 1 /hhh = 
1/t. Obviously the approximation involved in the last statement 
improves as r becomes larger. In insisting that r be macroscopic in 
size we are insuring a very good approximation. The volume of the 
octant in question in the lattice space is 


But from (59) and (63) 




r, 


= - V2mE } , 
h 




(64) 


and therefore the number of energy values required, whjch is just the 
n } value we have been looking for, is 

«, = |r (2 m)»E* dE,. (65) 


The quantum statistical distribution formula (47) then takes the 
form 


N, = 


iirmr/h 3 • V 2mEj dE 3 
a + e~ yi e E,/kT 


( 66 ) 


Once more we emphasize that in this formula a = + 1 corresponds to 
the Fermi-Dirac statistics and a = — 1 to the Bose-Einstein statistics. 
For particles with a spin quantum number Yi in which the energy is 
practically the same for both spin directions (cf. the remarks near the 
end of Sec. 1) it is necessary to multiply the right side of (66) by 2 to 
get the correct value of N t . For in this case we can effectively have 
two particles (with opposite spins) for each numerical value of the 
energy. This will be true with free electrons. For molecules without 
a spin, (66) should apply as it stands. On the other hand w for mole- 
cules with nuclear spin quantum number s, N, must be multiplied by 
2s + 1. 

w Cf. Mayer and Mayer, op. at., pp. 135 f. 
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S. QUANTUM STATISTICS OF A WEAKLY DEGENERATE GAS 


We have still the task of evaluating the statistical parameter 
in the distribution expression (66). This can be done by utilizing the 
condition SiVy = N, the total number of particles in the aggregate. 
In this connection considerable interest attaches in the first place to 
the special case a » 0. Then (66) should reduce effectively to the 
MaxwellianVlistribution of classical statistics. From the way we have 
expressed n, we must write 2 N, as an integral and have 11 


VE e~ s/kT dE = N. 


(67) 


kT 


The integral in (67) is a well-known one and is equal to — y/w kT. 

mt 

Hence the parameter y\ is given by 



1 

(2ir mkT)*' 


( 68 ) 


Substitution into N } , now rewritten as dN for greater consistency, 
gives 


dN 


AirmN 
(2, rrnkT)* 


y/2mEe~ E/tT dE 


(69) 


for the number of particles in the assembly having energies lying in the 
interval from E to E + dE. This can be readily transformed to give 
the distribution in terms of velocity by writing E = mv 2 /l. The result 
is 

dN = 4 tN {^ff v2e ~ mt/UTdv > (70) 


which, with allowances for difference in notation, is identical with the 
Maxwellian distribution in eq. (S3) of Chapter V. This at any rate 
indicates that the general distribution formula (47) includes the 
classical distribution as a special case, viz., that for a — 0. 

We must now consider the general case where — 1 ^ a ^ + 1 , with 
a * + 1 (Fermi-Dirac) and a - — 1 (Bose-Einstein) as the interesting 

11 This is really a matter of convenience justified by the fact already emphasized 
that the differences between successive eigenvalues are very small compared with 
the eigenvalues themselves. To handle the matter by means of a summation would 
involve using (59) directly, leading to a rather difficult problem in algebra. 
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limiting cases. The evaluation of e yi proceeds as before, the condition 
2JV) = N now taking the form 


2 ht 



y/EdE 
a + e ~y> e WT 


= N. 


(71) 


There is a certain convenience in introducing the transformation 
E/kT = «. Then (71) becomes 


2ht 


(2mkT)» - 

Jo e 



- 71 e u + a 


= N. 


(72) 


We shall set —71 = 7 and confine our attention for the rest of this 
section to the case in which 7 > 1 but is not sufficiently large for us 
to be justified in neglecting a compared with e y e u . Now we can trans- 
form the integral in (72) in the following fashion, letting a - ± b, 
where b is always positive, though a may be positive or negative. 
For the Fermi-Dirac and Bose-Einstein statistics 6 = 1 . For other 
intermediate brands of statistics 0 < 6 < 1. We shall carry through 
the general case. The integral in (72) then becomes 



y/udu 
e V + a 


1 r m y/udu 

b Jo ± 1 + 


Call e y ~ l0 ‘ b = e. 
integral 


The foregoing integral is a special form of the 


with a > 1 and p rational and positive. Since we can expand the 
denominator of the integrand in the form 


1 

e“ + “(l ± «- (a+tt) ) 


= e“ (a+u, [l =F <r (a+u) + e ~ 2(a+u} • • ■] 


it follows that 


U(p, a) 



( T 1 y-'e-^+^du. 


Introduce the transformation ju - t and utilize the fact that 



u'e~ ,u du 




r (p + 1) 


,p+i 


(73) 


(74) 
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The result is the convergent expansion 


U(j> 


H «) = (e- 


.—2a 


.-8a 


+ 


2 P+1 ~ 3 P+1 


•) 


r(p + 1). 


(75) 


In the particular case of (72) p = 1/2. Recalling that F(3/2) = y/r/2, 
we finally have 


r (2 t rmkT)*( 
~ h 3 l V 



+ ••• + (=Fl)" 


e — (n+1 )a 

(« + I)” 





From this the first approximation to a is obtained in the form 

-a _ ,l3Nb 

6 ~ r(2ir mkT) H ' 


(77) 


It is interesting to observe that if we set b — 1, this is just the expres- 
sion for e T1 in the classical statistics distribution formula. The second 
approximation to e~“ is 


h 3 Nb T h 3 Nb 1 

TQrmkTpl 1 ± 2 H T{2ivmkT)’ ii \ 


(78) 


This approximation introduces a distinction between positive and 
negative a. In (78) the plus sign corresponds to positive a, while the 
minus sign corresponds to negative o. In the limiting cases in which 
a = ± 1 we have 6 = 1. The value of e yi for the Fermi-Dirac and 
Bose-Einstein statistics respectively is given to the second approxima- 
tion (recalling that now e n — e~°) by 


£ T1 Fernu-Dirfto 


Tl 

® BoBe-Einatein 


h 3 N r h 3 N ] 

t(2tt mkTj* L 1 + 2 H T(2rmkT) ii y 

h 3 N f _ h 3 N 1 

r(2rmkT ) w L 1 2 H T(2irmkT) H j ' 


(79) 

(80) 


For 1 cm 3 of hydrogen at room temperature, the expression 
h 3 N/r(2imikT ) 3/ 2 has a numerical value of the order of 10~ 4 . Hence 
a is considerably greater than unity and the approximation in (78) 
is a very good one in this case. For reasons which will appear more 
clearly in the next section a gas for which this is true is said to be 
weakly degenerate. For the present we shall use the term degeneracy 
as an indication of the deviation of the properties of a gas treated 
quantum statistically from those of an ideal gas treated by classical 
statistics. The measure of the degeneracy is the value of e yi . For 
hydrogen under the conditions just mentioned the degeneracy is indeed 



Sec. 5] QUANTUM STATISTICS OF A WEAKLY DEGENERATE GAS 207 


so weak that e n reduces for all practical purposes to the value for an 
ideal gas in classical statistics. As the temperature is lowered, e yi 
increases and the degeneracy becomes greater. However for hydrogen 
even at its critical point e n is of the order of only 10 -2 . 

The distinction between quantum and classical statistics will be 
further brought out by a calculation of the total average energy of the 
gas as a function of the temperature. This is given by 


2 rr /•* 

E = XNjEj = -rj- • kT- (2 mkT) m / - 
h Jo * 


u i/2 du 
+ e~ yi e u 


(81) 


We again make use of the integral U(p, a) in eq. (75) with p — 3/2, 
and can finally write 


3r kT(2rmkT) m 
E = 2h 3 - 




+ * • • + (=F l) n 


c -(n + l)a 

(n + l) s/2 


+ 



(82) 


where the upper sign refers to positive a and the lower sign to negative 
a. Combining this with the expression for N in (76) gives 


E - % NkT 




nS/2 


+ • • • + (=F 1)" 


g— (n + l)a 

\n + 1) 5/2 


...) 


r 2 " 

o3/2 


+ ••• +(=Fir 


(n + lja 


(« + l) 8 ' 2 


+ 


-) 


(83) 


If e~° is sufficiently small we can approximate this successfully by 
£ = + ], (84) 


where the upper sign now refers to positive a and the lower sign to 
negative o. If we set for brevity 


jr h 3 N 

r(2*mkT) 3/2 ’ 


(85) 


(78) with b 


1 becomes 



( 86 ) 


We can then substitute into (84) and obtain for the approximate 
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expressions of the energy on the Fermi-Dirac and Bose-Einstein 
statistics respectively 


[ K 1 

1 + ^s75 “ ^7a + ' ’ ’ J ( 8 ?) 

£b-e = *NkT [l - fn ~ ^ ‘ ' •]■ < 88 > 


These may tfe considered as correct approximations as far as terms of 
order K 2 are concerned. It is plain that for a weakly degenerate gas 
the energy differs but slightly from the classical value. 

We now wish to obtain the equation of state of a weakly degenerate 
gas. This involves getting the entropy and free energy. For the 
entropy we have agreed (Sec. 3) to use the definition 

S = *log^. (89) 

By inserting the value of N } /n } given in eq. (45) into the expression 
for log w in eq. (42), we are led to 

S = k [. E/kT “ yiN + n > loB (1 + ae 7, e _£,/iT )]. (90) 

From (65), replacing the summation by an equivalent integration, we 
have 

-YVlog(l + ae -"e- B ’ /kT ) 
a ’ 

« ~ (2*0*1 y/E log (1 + ae n e~ B/kT ) dE. 


Partial integration of the integral leads finally to 


S — ky 1 N + f.f 

The free energy is 

¥ = E - TS = ky 1 NT - %E. 


The equation of state then becomes (eq. 12, Chapter III) 


P 



dyi 2 dE 

- NkT ~i + -3Tr- 


(91) 

(92) 


(93) 


From (78) with b = 1, we can evaluate yi (recalling that e~ a = e 71 ) 
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and from (84) we can get — , being careful to keep terms of appro- 

or 

priate order in each case. The final result is 



(94) 


It is worth noting that this result, which we have already derived in 
the classical kinetic theory (eq. 12 of Chapter V) holds fo* both brands 
of quantum statistics. It is therefore a very general formula. 

The equation of state for a weakly degenerate gas in the two brands 
of statistics then appears in the approximate form 

( K 2 K 2 \ 

1 + ^ — + •••)■•• Fermi- Dirac (95) 


( K 2 K 2 \ 

1 — ^ + • • • ) • • • Bose-Einstein 


As a consequence of these equations, a weakly degenerate Bose- 
Einstein gas is more compressible than an ideal gas, while a weakly 
degenerate Fermi-Dirac gas is less compressible than an ideal gas. 
Actually the differences are so slight that they are entirely masked in 
real gases by the departures from ideality owing to the forces between 
the particles. These are of course neglected in eqs. (95) and (96). It 
is only at very low temperatures that the terms in K and K 2 become 
appreciable and here the intermolecular forces become particularly 
significant. 


6. WEAKLY DEGENERATE GAS BY THE DARWIN-FOWLER METHOD 

It will be worth while to consider how the problem of Sec. 5 can be 
attacked from the standpoint of Sec 2. For this purpose we need 
expressions for iYand E analogous to eqs. (71) and (81) in Sec. 5. We 
also need an expression for the entropy. 

We proceed by analogy with the development in Chapter VII. 
There the sum of all the weights, 2 W, was shown to be equal to the 
contour integral 



where Z = 2 is the partition function. The expression for the 
energy in terms of the parameter f (later associated with the tempera- 
ture by f = e~ 1/kT ) was obtained by expressing the fact that Z/iP IN 
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has a minimum at z — f on the real axis. This led to eq. (20) of 
Chapter VII. What is the analogous equation in the case of quantum 
statistics? In eq. (23) of this chapter the sum of weights is given by 
the double contour integral 


2 W = 


4 \ 2 f f Mdxdz 
2iti) Jc 


We expect to find the connection between E and f in this case by 
expressing the fact that (M/s^) 1/N has a minimum at z = f on the real 
axis. This demands that 

dz\z*' N )~° at8 “f- ( 97 > 


The differentiation yields 
E = 


r^DogMUrtj. 

of 


(98) 


If we introduce from eq. (22) the Fermi-Dirac expression for M, which 
now takes the form 

M(/x,f) - JI (1 + (Fermi-Dirac) (99) 

J 

the energy expression (98) can be written 

e f-d = f £ n, log(l + nfi). (100) 

The parameter ft is the value of the complex variable x along the real 
axis where (M/x K ) 1/N has a minimum. The condition for this is 

( M/x N ) 1/N - 0 at x = ,t. (101) 

OX 


When the differentiation is performed the result is 

1 °g( 1 + Mf®0, (102) 

where again we are of course employing the Fermi-Dirac M. The 

reader will observe that M(J) in (99) is not quite the same M 

vised in eq. (22). We have incorporated in it the exponent Wy. the 
number of cells associated with the energy shell Ej. The reason for 
this step should be clear from the discussion in Sec. 3 where we com- 
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pared the Darwin-Fowler distribution formulas with those obtained 
by the alternative method. The Ej which we are using throughout 
our present development refers to a whole shell of values and not a 
single energy state. To it must therefore be attached the weight n,. 

The expressions for E and N in the Bose-Einstein statistics are 
obtained by the same reasoning as (1 00) and (102) . The only difference 
is the function M, which is now ' 

M({) — JJ (Bose-Einstein). (103) 

The results are 

■Eb-e = - r 3; y%, log(l - , (104) 

^b-e “ ” ^ 2^ »> log(l *') . (105) 

Let us go back and evaluate (102), using for this purpose the 
expression for n, already obtained in (65) and replacing, as usual, the 
summation over j by an integration over the energy. We also set 
f = e~ 1/kT . Then (102) becomes 

N r . D = M ^ r~r (2 »)* log(l + v~* /kT ) y/EdE. (106) 

. OH Jq ft 

Differentiation under the integral sign is here allowed and the result is 

*f-d = f? j d07) 

But this is precisely our eq. (71) with a = + 1 with 1 /#» in place of 
e -71 . In similar fashion 

Ef-d - f ^ jT" ^ (2 m)» log(l + y/B dE. 

The differentiation leads to 

£f-d = £ j - 1/m -prsr ( 108 ) 

With the transformation u = E/kT, this is identical with our previous 
eq. (81) when a = + 1. 

The reader may proceed to show that similar results are obtained 
with the Bose-Einstein statistics. It is, clear that the straightforward 
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application of the Darwin-Fowler quantum statistical formulas leads 
to all the results of Secs. 3, 4, and 5. We should, however, examine the 
entropy. In Chapter VII, (eq. 86), we saw that in the Darwin and 
Fowler method the entropy is defined as 

S = JfelogS W - * log 2VL 

This was found in the case of classical statistics to lead to 

5 = kN log Z(f) — kE log r — k log N !. (109) 


What is the corresponding expression in quantum statistics? This 
will presumably be obtained by replacing the partition function Z 
by its quantum statistical analogue. Actually to be rigorous one 
would have to evaluate the multiple contour integral (23) by a general- 
ization of the method of steepest descents used in getting the expres- 
* f Z N 

sion for the similar simple integral 1/2 irt- / -gxy dz of Chapter VII. 

Jc z 


This has been done by Fowler. 15 We shall not repeat it here but merely 
point out that 


k log 


/ 1 V f r M(x, z)dx dz 

W JcJc x N+1 z J5+1 


= k log M (ji, f) — kN log n — kE log f (110) 


to the same approximation which led to (109). The reader who has 
followed through the evaluation of (28) in Chapter VII will see the 
plausibility of (110) though for the rigorous demonstration Fowler 
must be consulted. Incidentally (110) shows that the quantum sta- 
tistical analogue of the partition function Z is 


Z 


qm 


[at( m , r)] 1/Ar 


( 111 ) 


If now we apply (110) to the Fermi-Dirac statistics we get, using (99) 
and ii = e yi 


S = k£n } (1 + e"e~*> /kT ) - kN yi + 1 , (112) 


which is exactly eq. (90) with a = + 1. The reader can proceed to 
derive the corresponding expression for the entropy on the Bose- 
Einstein statistics and again obtain agreement with eq. (90), this 
time with a - — 1, of course. 

11 "Statistical Mechanics,” second edition, pp. 47 ff. 
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7. FERMI-DIRAC STATISTICS OF A STRONGLY DEGENERATE GAS 

We shall now return to the general quantum statistical distribution 
law (47) and consider the case of Fermi-Dirac statistics (a = + 1 ) when 
7 i is positive and e yi so large that for a considerable range of values of 
E „ we have 1 2 > e~ yi e El/kT . In line with the discussion in Sec. 5 we 
shall refer to this as a case of strong degeneracy, since it necessarily 
implies a considerable deviation of N } /n } from the classical distribu- 
tion formula (48). 

As before we evaluate e yi by utilizing the expression for N in terms 
of 7 i and T, i e., eq. (72). For a = 1 this becomes 

- (.13) 


The integral in (113) is a special case of the integral 


U’(j>, 7l) 


“jfr 


u p du 


+ e- yi e u ' 


(114) 


where 71 1 and p is rational and positive. Let us introduce the 

transformation 7 jy = — 71 + « and obtain 

t/'(p, 7i) = 7f +1 ^ (1 + ^ 


-1 


1 +e y 


(115) 


This may further be written 


U'( P ,y t) = 7f +1 



(1 - yYdy 
1 + e~ yiv 


+ 



(1 + yYdy ^ 

1 +e yiv J‘ 


The first integral on the right becomes 


1 f l (1 - yYdy 

(p + 1 ) J 0 1 +e™ ’ 


whence 

d'O.-ri) - rf +1 [^^p T j 


f 1 iX-yYdy r 

X l+e™ X 


(l-Fy)^ 
1 + « T1V 



Now since 7 X » 1, 1 + e yiv is so large that the integration from 1 
to « adds very little to the value of the second integral in the bracket. 
Consequently, we can write to a good approximation 


V{p, 71 ) = 7f +1 


t(p + 1) Jo 


M(i + yr-(i-y)*? 

1 + 6™ 


']• 


dy\- (117) 
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Since y < 1 we can expand the numerator in the integrand by the 
ordinary binomial expansion and obtain for the integral 

<118 > 


The binomial coefficients are written in the form 


( P ) 

\v+i/ 


as in 


eq. (7), Chapter II. The integral in (118) can further be written as a 
double sum by the expansion of 1/(1 + e yiV ), viz., 

2Y'(-1)Y P \ f e~ lk+l)yiv )? 3+1 dy. (119) 

XI \2j+i/Jo 

The next step is to let t = {k ■+■ l)7iy, whence the above integral 
becomes 

_ /•<*+ e~ l P ,+1 dt 

2 § (_1) (*+>)X E<* + D-ri ] 2j+2 ‘ (120) 

This suggests the gamma function, save that the integration limits 
are from 0 to (k + l)yi- However we again use the fact that 71 is 

J r® 

r/ , ■ . .. — is neg- 
(*+i)7i L( fe + l)TiJ 

ligible compared with the integral in (120). Hence we can safely alter 
the upper limit in (120) to °° and write 

2V 1 (-1)* e-'fi+'dl 

4 / ^ [(* + lhxf - f2 


= 2^(-l)* 


( P ) 

\2j + l/ 


“*77 [(* + D7i] 2 ' +a 

Finally we have for U'(j>, 71) 


r(2j + 2). 


( 121 ) 


r C2 i+2 ( P )T(2j + 2)1 

7i) = 7? + |1 /(p + 1) + 2g ^ ± ^ J , (122) 


where we have set 




(- 1 )* 

\&+2 — c i )+ 2 - 


(* + 1)* 

Using (122) in connection with (113) enables us to write 


2-JTT 

N = -^ (2 mkT) 


>■•*[! +*£ 


c ^ +2 ( 2; ^. 1 ) + 2 ) 


7 ? , +2 


] 


(123) 


(124) 
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For most applications it will be sufficient to confine our attention to the 
first term in the summation in the bracket in (124) and write 


N 


2 VT 

-jp (2 tnkD 


Now 


L3 TiJ 




(- 1 )* . 1.1 1 


1 ^2 "t" .8 


+ l) 2 2 2 1 3 2 4 2 U 


(125) 


(126) 


Consequently to this approximation, the equation connecting N and 
Yi becomes 


N = (2 tnkT)»y? [1 + ,r 2 /8 r 2 ]. 


(127) 


Recalling that y\ 2> 1, the first approximation to yi from (127) is 


2mkT 


(128) 


The second approximation proves to be 

( 3N\ * h 2 [ t ( 3N\~ * (2rmkT) 2 ] 

■" - kJ ■ SSrL 1 - KT„) ■ — iap— J- (129) 

The total energy is given by eq. (81) with a = 1. In terms of the 
general integral U'(p, y{) m (114) it is given by 


E = 


2*t 

1? 


kT (2mA IT ) 54 £/'(3/2,7i). 


(130) 


To the same order of approximation as that used in eq. (127) the 
result is 

AvrkT 


E = 


5 h 3 


(2 mkT)»yi* U + Sir 2 /8yfj 


(131) 


On dividing (131) by (127) and using the second approximation for 
7 x, i.e., (129), the energy may be expressed in the form 


E = 


3 Nti 
10 m 


...]. (-I 


The interesting difference between this result and that for an ideal 
gas on classical statistics is at once apparent. In the latter case we 
have simply E = 3/2 -NkT, and as T approaches zero, E also ap- 
proaches zero. For a strongly degenerate gas, on the other hand, 
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the energy approaches a non-vanishing value at the absolute zero. 
This is known as the zero-point energy of the gas. Its value is 


3_N#(ZN\» 
^ 10 m \4itt/ 


(133) 


The form of E in its dependence on T in (132) is also very different 
from that fos a weakly degenerate gas, given in eq. (87). Of course, 
we cannot let T —* 0 in (87) because the approximation which enables 
us to write (87) no longer holds for very small T. 

The existence of the zero-point energy is of sufficient importance 
for us to look upon it from another point of view. Of all the cells in the 
phase space associated with the gas only one can correspond to the 
energy value zero. On the Fermi-Dirac statistics there can not be 
more than one particle in this cell, if it is a particle without a spin. As 
we have seen in the discussion at the end of Sec. 4, there may be two 
electrons of opposite 6pin in each cell. For the moment, however, we 
confine our attention to ordinary gas molecules. Now at the absolute 
zero of temperature we should expect to find all the particles in the 
lowest energy states, i.e., all cells filled up to a certain maximum 
energy value. Consequently the total energy in this situation cannot 
vanish. This crowding of the particles into the lower energy states at 
T = 0 is another physical interpretation of the degeneracy of the gas. 
At T — 0 the degeneracy is complete. As the temperature rises some 
of the particles leave the lower energy states and are transferred to 
higher states; the degeneracy decreases. When the distribution of 
particles with respect to energy has become Maxwellian the degen- 
eracy has effectively disappeared and the gas is classical. 

The ideas of the previous paragraph can be used to give an alter- 
native calculation of the zero-point energy. We recall from Sec. 4, 
eq. (65) that the number of energy values for an aggregate of free 
particles lying between E } and E, + dE } is 

^?(2 m^V^dEj. 

If all energy levels from zero to some upper limit -EL— are each occu- 
pied by a single particle and the total number of particles is N, the 
maximum energy value is given by 

(2m) H J^ Ea “VEdE = N. 


(134) 
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Hie integration yields 


' 2m W ' 


(135) 


The total energy of the whole degenerate assembly then becomes 


o__ /• Umax 

Eo = -jjj- ( 2 wi ) M jT ^ dE 

3 JW> 2 /31V \ H 
10 m \4 itt/ ’ 


(136) 


in precise agreement with the value given in (133) and computed from 
(132). 

We can also calculate the entropy for a degenerate Fermi-Dirac 
gas. We have already noted that eq. (91) is general and does not 
depend on the degree of degeneracy. Substitution of m from (129) and 
E from (132) into (91) yields 

„ ■K 2 k 2 NmT(3NY H , . , . . . , 

5 = 1 I + terms involving higher powers of T. (137) 

h 2 \4tt/ 

As T — » 0, it follows that S~*0 and the entropy of a strongly degener- 
ate gas vanishes at absolute zero. 

The remarks immediately above about the physical interpretation 
of statistical degeneracy gain even more significance from a plot of 
Nj/rij as a function of velocity magnitude. For the non-degenerate 



gas (eq. 48) this, of course, is just the probability curve already shown 
in Fig. 2-1 of Chapter II. The corresponding plot for a strongly de- 
generate Fermi-Dirac gas is given in Fig. 8 • 2. For a very considerable 
range of values of the velocity v,- from zero up, N,/nj remains prac- 
tically constant at the value unity, indicating that all the lowest energy 
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cells are occupied, each with its single allowed particle. As the higher 
velocities are approached, however, NJtij drops more or less rapidly 
to zero, the rate of fall depending on the actual value of 71 and there- 
fore largely on the temperature. In this part of the curve the classical 
Maxwellian distribution is simulated to a certain extent. The, value 
of the velocity at which N,/tij = i-e., half the cells are occupied on 

the average, is 

»o = 



wherein we must use the value of 71 given in (128) or (129). The 
result of the substitution from (128) is 



The average energy per particle at T = 0 is, from (136), 

Eo = ± h 2 /3N\ H 
N 10 tn \4itt/ 


(140) 


This leads to a root-mean-square velocity at absolute zero of 

Vm = V| VO- (141) 


8. FERMI-DERAC STATISTICS OF A DEGENERATE ELECTRON GAS 

In the preceding sections we have not specialized the assembly of 
free particles to which the quantum statistical distribution formulas 
have been applied, save in so far as we have assumed them to be par- 
ticles without spin. In the present section we shall consider specifi- 
cally an electron gas in which the mutual interactions are neglected. 
To make the problem more specific we shall assume that the number 
of electrons per cm 3 is of the same order of magnitude as the Loschmidt 
number for an ideal gas. For example the gas might consist of the 
(hypothetical) free electrons in a metal in which on the average at 
room temperature there is one free electron per metallic atom. This 
will indeed form an interesting and important application of the theory 
of this section. The first thing we must note in the present discussion 
is that Nj in eq. (66) must now be multiplied by 2 in order to take 
account of the electron spin. This factor will follow through all the 
significant formulas. The easiest way to take care of its introduction is 
to replace t b yJtr wherever it occurs. 
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The next question to settle is this: Is the assembly of electrons 
degenerate or non-degenerate at room temperature? Going back to 
eq. (77), we recall that if t~ a — e yi (6 is equal to unity, of course) is 
very small compared with unity, the degeneracy is extremely weak. 
This is true for actual gases over a wide range of temperatures. For 
the electron gas, however, such as we might expect to find in the metal 
silver with one free electron per atom, substitution of m — 9 X 10 -28 
gram, N/r = 5.9 X 10 32 , T = 300° K, into • 

h 3 N 

e “ IrQnrnkT)* ' 


gives e~° of the order of 2,330, which is far from being small. Evi- 
dently for such an assembly even at very high temperatures e~“ will 
still be much greater than unity, indicating that below temperatures 
of the order of 10,000° K, an electron gas of the kind considered will 
be strongly degenerate and must be treated by the appropriate form 
of the Fermi-Dirac statistics. The formulas (129) and (132) are the 
ones we must apply. 

It is interesting to calculate the specific heat of such a degenerate 
electron gas. The specific heat per electron at constant volume is from 
(132) 


Per gram atom we have for the heat capacity 

LT 2 mk 2 (3N\-» _ RT*mk(3N\-» 

Lv ~ h 2 \8 wr) h 2 W ) ’ 


(143) 


where L is the Avogadro number and R the gas constant per gram atom. 
If now we compute the right-hand side of (143) for T = 300° K for the 
case of silver, already mentioned, we get approximately 

C v = 2.4 X 10- 2 it 

The interesting thing about this result is that the classical equi- 
partition principle when applied to an electron gas gives C v — 3R/2, 
independently of the temperature, which is a much larger value than 
that above at room temperature. If we are considering an electron 
gas in a metal with approximately one free electron per metal atom, 
the contribution of the free electrons to the specific heat of the metal 
at room temperature is therefore relatively very small. Practically 
all the observed specific heat is due to the atoms themselves. This 
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solves a fundamental difficulty that has always plagued the classical 
electron theory of metals. This difficulty was the one of allowing 
enough free electrons to account for the observed electrical properties 
of metals and at the same time keeping the specific "heat down to the 
experimentally observed value which at ordinary temperatures is given 
very closely by C® = 3R (law of Dulong and Petit. Cf. Chapter IX, 
Sec. 3). In the classical theory the two requirements seemed quite 
irreconcilable* Quantum statistics appears to solve the problem very 
nicely. 


9. BOSE-EINSTEIN STATISTICS OF A STRONGLY DEGENERATE GAS 


We have discussed weak degeneracy from the standpoint of both 
types of quantum statistics and strong degeneracy from the standpoint 
of the Fermi-Dirac statistics. What constitutes strong degeneracy in 
the Bose-Einstein statistics? The fundamental distribution formula 


for the latter type is 


, Airmr/h 2 • V2 mE, <LE, 
N > e ~^ /kT - 1 


For weak degeneracy 71 is negative so that e~ yi > 1 and the first 
term in the denominator outweighs the unity even for small values of 
E } . For strong degeneracy in the Fermi-Dirac statistics e~ T ‘ <5C 1. 
Evidently such a situation is meaningless in the Bose-Einstein case as 
it would lead to negative values of Nj for a considerable range of 
values of E,, Consequently we are led to the conclusion that the 
strongest degeneracy, i.e., the greatest deviation from classical statis- 
tics, possible for a Bose-Einstein gas corresponds to e~ yi = 1 or yi = 0. 
This has some interesting consequences. In the first place, the total 
number of particles in such a strongly degenerate Bose-Einstein gas is, 
from eq. (76) 

00 

N = 1 3 (2 rmkTf ■ ^ ^ ( 2 rtnkT)^, (145) 

Tl“ 1 


where the factor 2.61 is an approximation to the summation indicated. 
For a given volume of such a gas with a given number of particles, 
eq. (145) prescribes the temperature at which complete degeneracy is 
attained. A glance at (76) shows that if the gas were slightly less 
degenerate, i.e., e~° < 1 slightly, N would become smaller. By no 
change can it become larger than the value given in (145). Conse- 
quently we can make the statement that for a degenerate Bose- 
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Einstein gas the number of particles per unit volume has an upper 
limit which at temperature T is 

~ = ^ (2ir mkT) H . (146) 

This corresponds to a maximum equilibrium density for any particular 
temperature, a situation very different from that holding for an ideal 
gas. Any larger density than that given by (146) will necessarily cor- 
respond to a non-equilibrium state. 

We can evaluate the energy for the case of complete degeneracy by 
reverting to eq. (81) and computing E for yi = 0. The result is given 
by (82) with a = 0 and the choice of the lower sign throughout. 
Thus 

*-J? cww-^4- <>«) 

»- 1 
OP 

Since s l/» w = 1.34 approximately, the expression for E takes the 
approximate form 

-E = (147') 

The equation of state in the completely degenerate condition can now 
be found at once from eq. (94). It is 

P = (148) 


In other words, the pressure is dependent on the temperature alone. 
This is, however, the situation one encounters in the pressure of sat- 
urated vapor and suggests that the strongly degenerate Bose-Einstein 
gas behaves like a gas below its critical point, so that when the temper- 
ature is reduced to a low enough value a certain kind of “condensation” 
takes place, removing from the higher energy states a certain number 
of particles and transferring them to the lowest energy state where 
they make no contribution to the pressure of the gas, a possible effect 
in the Bose-Einstein statistics that is of course not present in the 
Fermi-Dirac statistics. This “condensation” phenomenon has been 
studied in detail by F. London l * and possible applications discussed, 
particularly to the interesting properties of liquid helium near its 
transition point at 2.19° K. 

11 F. London, Phys. Rev. 54, 947 (1938). 
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Equation (147') cannot be used to derive the expression for the 
specific heat at constant volume. For this purpose We must use the 
closed form expressions (72) and (81). From (81) we have 



f- f+^dwOW 


/£7i\ f* mV" 71 

\ar ) T J 0 («- 71+B -i) a 


du (149) 


where the integral certainly remains finite as 7 i — » 0. The factor 
( can be evaluated by differentiating eq. (72) (with a = — 1) 

r 

with respect to T. We obtain after conducting a partial integration 


/ fryi 

W 



rr 

¥ 


3N/2T 

/ * u~* 

J^Z\ du 


(150) 


However, as 71 — * 0 the integral in the denominator approaches 
infinity and hence in the completely degenerate state the specific heat 
is given by 

c - -(f),- if- (15,) 


From (146) combined with (147') we then obtain the approximate 
result 


C v = 1.92?. 


(152) 


10. STATISTICS OF A PHOTON GAS. RADIATION LAW 

Perhaps the most interesting application of the Bose-Einstein 
statistics of a strongly degenerate gas lies in the field of radiation. Let 
us consider a radiation field in physical volume r to be equivalent to a 
collection of light particles or photons corresponding to a wide range of 
frequencies. With a photon of frequency v is associated energy hv and 
momentum hv/c, where c is the velocity of light in free space. We 
desire the distribution formula for the photons with respect to fre- 
quency, i.e., for a radiation field in the equilibrium state corresponding 
to temperature T, the average number of photons with frequency 
lying in the range from vtov + dv. We shall assume that the photon 
gas is effectively a collection of free particles obeying the Bose-Einstein 
statistics and in a state of strong degeneracy at all temperatures. 

Obviously we cannot apply directly the distribution formula (144) 
with 7 x = 0, since there is no meaning attached to the mass of a photon. 
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However, we can rewrite (144) in terms of the momentum p, in place 
of energy in the numerator and so get rid of m. For a free particle 
we have at once 


E, 


A 

2m 


Consequently the distribution with respect to momentum (with 
yi = 0) 14 becomes 


_ 4xt p]dp 3 
1 " h* t E > /kT 


(153) 


If now we set p, = hvjc and E } = hv 3 , the above expression becomes 


N, 



v^jdvj 

e^ /kT -l 


(154) 


Let us make the further assumption that the photons, like electrons, 
possess a spin. This effectively doubles the number N } (cf. the corre- 
sponding situation in the case of electrons which we take account of 
by multiplying r by 2 wherever it occurs). When we leave off sub- 
subscripts and denote by dN the number of photons in the frequency 
range from v to v + dv, we have for the average number of photons per 
unit physical volume in the range mentioned 


dN 8wv z dv 

T = <?(e h ' /kT -l) 


(155) 


Since each of these photons has energy hv, the average energy density 
in the radiation field for frequency v becomes 


8irhv 3 dv 

dE ’ = ?(e h ’ /kT -l)' 


(156) 


This is the radiation law of Planck which has been found to be in sub- 
stantial agreement with experimental observations. Its derivation 
confirms that a photon gas may be considered to behave like a strongly 
degenerate Bose-Einstein gas. 

For very low frequencies or high temperatures (156) can be very 
accurately approximated by 

dE. = (157) 


14 It should be stated that in the case of a photon gas 71 = 0 follows at once 
from the simple fact that the number of particles is not fixed and hence the condition 
XtNf = 0 has no longer a meaning. The parameter 71 then really does not enter 
the problem. 
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or in terms of wavelength interval (X = c/v) 


dE, - 


SvkTdX 

X 4 


(158) 


Equations (157-158) are equivalent to the Rayleigh-Jeans law, which 
describes the distribution of energy in a radiation field rather accurately 
for long wavelengths. On the other hand for high frequencies or low 
temperature^ (156) takes the approximate form 

dE, = ~~ .t- h ' /iT dv, (159) 

c 


the well-known distribution formula of Wien. 

From the historical point of view it is interesting to recall that 
Bose’s original deduction of Planck’s radiation law was the starting 
point of quantum statistics. This is reason enough for reviewing 
Bose’s method 18 here, even though it is, of course, somewhat off the 
track of the systematic development of this chapter. 

In considering the distribution of objects in boxes or cells let us for 
the moment focus our attention on the cells rather than the objects. 
Thus of a total of Q cells we let Qo be the number which contain zero 
objects; Qu the number containing one object; • • • Q„ the number 
with j objects, etc., with 

if 

Q, = Q . (160) 



where N is the maximum number of objects per cell. The number of 
ways of selecting the Q cells so that Qo contain no objects, Qi one 
object, etc., is 


n = 


£! 

Qo'Qil — (?#! 


(161) 


We now proceed to make II a maximum subject to the conditions that 

N 

^ E 3 Qj — E — total energy = constant, 


J-0 


N 

^ Q] ~ Q = total number of cells = constant. 


(162) 


j-o 


Here E, must now mean the energy associated with the cell which 
contains j particles. Proceeding as usual in the case of a canonical 


» Z Phystk, 26, 178 (1924); 27, 384 (1924). 
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distribution (Sec. 2, Chapter IV) and calling the parameter @ = kT, 
we find « _ ■ ivt 

Q, - & '' 


N 


J-0 

The average energy per cell appears as 


(163) 


R,fkT 


N 


t5 * J-0 


Ea 

j-0 


(164) 


Now if the particles being distributed in the cells are photons we must 

assume that _ ., ..... 

E, = jhv. (165) 

Hence a 


£ = 


—jhr/kT 


But 


w 

2 > 


J-0 
lh*/kT _ 


1 


J-0 


1 _ g-WT ’ 


(166) 


(167) 


and if the maximum number of photons per cell is large, as we have 
reason to suppose will be the case, we can replace the finite sum in the 
denominator of (166) by the infinite sum in (167). If we differentiate 
(167) with respect to T, the result is 

hv/kT 2 -e~ h ' /kT 


-L hv/kT ..'\ 

kT 2 (1 - e" 


-h,/kTy 


from which (166) can be written in the form 

hv 


E = 


>/*r 


(168) 


If the photons are considered to form a gas of free particles in a volume 
r with momentum values hv/c, the average number of them in the 
energy interval hv to hv + d(hv ) will be, from Sec. 4, 

&ttv 2 (Iv 


( 169 ) 
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We have here assigned two photons to each energy value. Since the 
average energy per photon in this range is given by (168) it follows that 
the average energy density of the radiation in the frequency range from 
v to v + dv is given by (156), i.e., the Planck law again. 

11. APPLICATION 07 QUANTUM STATISTICS TO ATOMIC STRUCTURE 

One of the most striking applications of quantum statistics is the 
determination of the average distribution of charge in an atom. It will 
be recalled that in the nuclear atom model an atom is assumed to con- 
sist of a nucleus with a positive charge and most of the mass of the 
atom surrounded by an aggregate of electrons equal in number (for 
the neutral atom) to the charge on the nucleus. It is the problem of 
atomic structure to determine the possible energy values of such a 
system as well as the average distribution of charge considered as 
effectively continuous. The solution also provides the potential field 
in the neighborhood of the atom, which is important in many applica- 
tions of atomic structure. The complete formal solution is indeed 
possible only occasionally, i.e , when there is but a single electron, e.g , 
hydrogen, ionized helium. Various approximation methods have been 
devised to solve the problem for polyelectronic atoms. One of the 
most interesting of these was proposed by Fermi, 14 and independently 
by Thomas. 17 Their assumption is, in effect, that the electrons in a 
polyelectronic atom behave as a degenerate gas obeying the Fermi- 
Dirac statistics. If we assume that the electrons in an atom occupy a 
sphere of radius approximately 10 -8 cm, the density of the electron 
distribution will vary from 10 24 to 10 28 per cm 3 . From the considera- 
tions of Sec. 8 of this chapter, we see that such a gas will indeed be 
degenerate for all ordinary temperatures. 

The problem now in question differs from that previously discussed 
in Secs. 7 and 8 in one important respect, namely that whereas there 
we considered the electrons in the gas as being free and therefore 
possessing only kinetic energy or at most existing in a field of constant 
potential, we must now think of them as moving in a variable field and 
for this reason as possessing, in addition to kinetic energy, potential 
energy varying from point to point. It will be seen, however, that the 
only effect of this is to make the velocity distribution a function of 
position in space. In fact we can use the ordinary Fermi distribution 
law (eq. 47) with a = + 1 merely by incorporating with the energy E, 
the added potential energy —eV, where V is the potential of the field 

“ Z. Pkystk, 48, 73 (1928); 49, 55 0 (1928). 

11 Proc. Cambridge PM. Soc. 23 , 542 (1927). 



Sec. 11] 


APPLICATION TO ATOMIC STRUCTURE 


227 


and a function of x, y, z. Thus we can now write for the number of 
electrons having total momentum lying in the interval (p, p + dp) at 
the point (x, y, z) 

3 XT 8irr p 2 dp 

~ 1 Ijl e --n-eV/kT e tf/2mkT‘ (170) 

This follows readily from eq. (66) with the appropriate representation 
of Ej in terms of momentum. Strictly speaking it is necessary to 
generalize the derivation of n } in Sec. 4 to include the constant poten- 
tial energy term. When this is done, eq. (170) follows. The total 
number of electrons per unit volume at (x, y, z) then becomes (using 
the first approximation for N in eq. (127) with r replaced by 2r to take 
care of the spin and yi replaced by + eV/kT) 

” - 7 ■ + <m) 

It is now found convenient to set 

r = v + k ~^T ' (172) 

where v thus appears as the potential of the atomic field expressed to an 
arbitrary additive constant kTyi/e Equation (171) then becomes 


3\/’ r h 3 


(2vrme)*V*. 


The equivalent charge density, which we shall designate by p, is equal 
to — ne. If we treat this as a continuous static distribution, it must 
satisfy Poisson’s equation, viz , 

V 2 v = — 47 rp = (27 rmef^ev^. (174) 


As a first approximation we shall take the distribution to be spherically 
symmetrical, so that v is a function of r only, where r is the distance 
from the nucleus. Then 

_ d 2 v 2 dv „ a* ..... 

<1,5 > 

with /- 

C = 

The solution of (175) must be found subject to the boundary condi- 
tions /- 

lim rv = Ze; I ndr = Z, (176) 

r -o J 
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where Z is the number of positive charges on the nucleus, i.e., the 
atomic number, and the integration is to be extended over all space. 
The first of the above conditions expresses the fact that when an elec- 
tron is very close to the nucleus its potential is practically that due to 
the nucleus alone. The second condition expresses the fact that the 
total charge in the distribution corresponds to Z electrons, for the 
neutral atom. This will obviously not hold for an ionized atom, but 
the necessary alteration is fairly obvious and introduces no funda- 
mental change in principle. 

Equation (17S) may be considerably simplified by the substitutions 
x — r/(Ze)~* i C~ H and £ = »/(Ze)**C w . The equation then becomes 


dx 2 x dx ' 


(177) 


subject to the boundary conditions 

lim =1, f Z^xPdx = 1. (178) 

*-o J 0 

Finally we set <f>(x ) = x( and reduce (177) to an equation without the 
first order term, viz., 


d 2 <t> _ 
dx 2 y/x 


(179) 


The boundary conditions (178) take the final form 

/ QO 

y/x^dx = 1. (180) 

_ 

Eq. (179) has been solved numerically by Fermi and the solution is 
tabulated in the accompanying table. 



Table of Values 

OF IN (179) 



X 

*(*) 

X 


X 


4>(*) 

0 0 

1 000 

1 5 

0 315 

10 

0 

0 024 

0 1 

0 882 

2 0 

0 244 

11 

0 

0 020 

0 2 

0 793 

2 5 

0 194 

12 

0 

0 017 

0 3 

0 721 

3 0 

0 157 

13 

0 

0 014 

0 4 

0 660 

3 5 

0 130 

14 

0 

0 012 

0 5 

0 607 

4 0 

0 108 

15 

0 

0 011 

0 6 

0 562 

5 0 

0 079 

16 

0 

0 009 

0 7 

0 521 

6 0 

0 059 

17 

0 

0 008 

0 8 

0 485 

7 0 

0 046 

18 

0 

0 007 

0 9 

0 453 

8 0 

0 037 

19 

0 

0 006 

1 0 

0 425 

9 0 

0 029 

20 

0 

0 005 
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We can U9e this solution to obtain the potential v and the 
density p. Thus r 1 

and 


P = ne = - 

4 


C (Ze)* jH 






charge 

( 181 ) 

(182) 


This solves the problem of the statistical charge distribution in the 
atom. We should expect the resulting potential field to be a fair 
approximation to the actual field, and indeed one which improves as 
the number of electrons increases. As a matter of fact the approxima- 
tion is found to be a very good one for all atoms, except in the outer 
regions of the electron distribution. It is a very useful approximation 
in various applications of atomic structure, particularly those encoun- 
tered in the structure of metallic crystal lattices. 


PROBLEMS 

1. In a cube of gold, 1 era on a side, assume that the number of free electrons is 
equal to the number of atoms. Calculate a few energy values for the electrons for 
small values of n\, i»s, nj in eq. (59) and do the same for some larger values of these 
integers, e g , of the order of magnitude of 10*. Compare the spacing of successive 
energy levels in both cases. 

2. If the free electrons of the cube of gold in Problem 1 form a degenerate gas at 
absolute zero with all energy levels filled, calculate the maximum kinetic energy. 
Also calculate the total kinetic energy of the electrons. 

3. Calculate the total energy of one mole of oxygen at its critical temperature, 
treating it as a Bose-Etnstein gas 

4. At what temperature (order of magnitude) does the electron gas in metallic 
gold cease to be degenerate? 

5. Prove that the number of combinations of », objects N, at a time with all 
possible repetitions allowed is equal to the number of combinations of n, + N, — 1 
objects Nj at a time without repetitions. Hence derive the Bose-Einstein statistical 
probability (41"). 

6. Show that the isothermal compressibility of an ideal gas is equal to the recip- 
rocal of the pressure. Find the expressions lor the isothermal compressibility of a 
weakly degenerate Bose-Einstein and a weakly degenerate Fermi-Dirac gas. 

7. Use eq. (94) to find the approximate equation of state of a strongly degenerate 
Fermi-Dirac gas. Compare the isothermal compressibility with that of a weakly 
degenerate Fermi-Dirac gas. 

8. Derive the expression for the entropy of a strongly degenerate Fermi-Dirac 
gas out to terms of the order T*. 

9. At what temperature does the atomic heat capacity of the strongly degenerate 
electron gas in metallic silver become equal to the value predicted by the classical 
equipartition principle? 

10. Compute in tabular form the Fermi-Thomas statistical charge distribution 
in the neutral sodium atom in its normal state 
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SPECIFIC HEATS OF GASES AND SOLIDS 

1. SPECIFIC HEATS OF AN IDEAL MONATOMIC GAS 

The two specific heats at constant volume and constant pressure 
constitute important characteristics of a gas. In this chapter we dis- 
cuss in systematic fashion their theoretical evaluation for certain types 
of aggregates. They have already been defined in Sec. 4, Chapter III 
and discussed in preliminary fashion in Sec. 2, Chapter V. If the 
average total energy of the aggregate is E and its total mass is M, the 
specific heat at constant volume is 



and that at constant pressure is 



In this section we shall be concerned with an ideal monatomic gas 
consisting of N free mass particles of mass m and each possessing three 
degrees of freedom. We shall follow the Darwin-Fowler method in our 
discussion and use the energy expressions obtained in Chapter VII 
and Chapter VIII. For a classical ideal gas, we have 

e = ivr^iogzcr), (3) 

of 

where Z(f) is the partition function 

z d) O') 

3 

and f = e~ 1/kT . The E, in the sum are the possible energy values of 
any particle in the aggregate. If we transform from f to T, (3) becomes 
* 

E = NkT2 jf l ° S Z (T)> 
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( 4 ) 
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and the specific heat at constant volume is 

(5) 

From eq. (100) of Chapter VIII we can write the corresponding expres- 
sion for an ideal gas obeying the Fermi-Dirac statistics as 1 

<*•>’■> '<* C« 

The corresponding specific heat at constant volume for a Bose -Einstein 
ideal gas (from 104 of Chapter VIII) 

Mb.* — ^ [** Yr '°8 0 - *~'" / **>]/ (7) 

We have already used (6) or its equivalent in evaluating the specific 
heat of a degenerate electron gas (Sec. 8, Chapter VIII, eq. 142). We 
have also seen that for an actual gas composed of neutral molecules 
for which the Bose-Einstein statistics is indicated, the modification 
due to the use of the former in place of classical statistics is so small 
under normal conditions as to be negligible. Consequently in what 
follows we shall use the classical statistical formula (5) for specific 
heat calculations. 

The problem reduces essentially to the determination of the parti- 
tion function Z[T) given in the classical statistics by (3'). For an 
ideal gas this is given by eq (77) of Chapter VII. We write it again 
for convenience, 2 expressing it as a function of T. 

Z{T) = Y 3 (2*mkT)». ( 8 ) 

The application of (5) yields at once the familiar result 

3 k 

Cr = 2m ’ ® 

and the associated value of c v also follows directly. 

2. THE IDEAL DIATOMIC GAS 

The ideal diatomic gas consists of an aggregate of freely moving 
molecules, each of which is composed of two similar atoms. The 

1 It should perhaps be stressed that in (6) and (7) only the differentiation of the 
bracket is to be conducted at constant volume. 

* In this chapter we use V for volume. 
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determination of the allowed energy values of a diatomic molecule is a 
problem in quantum mechanics which we shall not work out here. We 
merely remind the reader that the energy of such a molecule is made up 
of three parts, viz., (a) energy of translation of the center of mass, ( b ) 
energy of rotation, and (c) energy of vibration. The possible energy 
values for translation have been worked out in Sec. 4, Chapter VIII 
and are 



If we simplify by assuming that the molecules are confined to a cube of 
side l and volume V = / 3 , the above expression becomes 

Et = g mV H (»? + + «!)• (10) 


Here j*i, n 2 , and n 3 can take all integral values. 

The possible energy values for rotation for a two-dimensional 
rotator are * 


E r 


Kj + DA 2 

8ir 2 / 


(ID 


where j takes on positive integral values and I is the moment of inertia 
of the molecule considered as a dumbbell. I is taken about an axis 
perpendicular to the line joining the atoms and passing through the 
center of mass. If indeed the molecule consists of two different atoms 
of masses wij and m 2 respectively separated by an equilibrium distance 
a, we have 



In the special case being considered in this section, — m and 


I = 


ma 2 

T" 


(13) 


The possible energy values for vibration are taken to be those for a 
simple harmonic oscillator of frequency v, namely (cf. eq. 53, Chapter 
VII), 

E v = <J + \)hv, (14) 

where j takes on positive integral values including zero. 

'Cf. for example, Lindsay and Margenau, "Foundations of Physics,” p. 435. 
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The total energy of the molecule will then be expressible as the sum 
of the energies E t , E r , and E v . The writing of the partition function 
(3') might appear to offer some difficulty because of the presence of 
the weight factors gj. These can be omitted explicitly, however, if we 
incorporate them into the energy levels themselves by agreeing to 
repeat in the summation terms which have weight greater than unity. 
We then write 

Z(T) = s e - ( *<+* r +*»>/M’ | . (15) 

the sum being taken over all values for all three energy types. Because 
of the exponential form, simplification is possible. Thus 

Z(T) = Ze~ Bl/iT -Ze- Er/tT -Z,e- E ‘ /kT , (16) 

or 

Z(T) = Z t (T) Z r (T) Z V (T). (17) 

We can then determine the partition functions for translation, rotation, 
and vibration separately and find the total function by multiplication. 

Let us first investigate the partition function Z r (T). From quan- 
tum mechanics it develops that each rotational energy state j has a 
weight 2j + 1 associated with it. Hence 

00 

Z r (D =2] ( 2 i + l)e- ,0+1,M/8 ’’" r . (18) 

7=o 

It turns out 4 that (18) with j allowed to take all positive integral values 
is the partition function for rotation for diatomic molecules composed 
of different atoms. If, however, the atoms are identical, i.e , the mole- 
cule is that of an element, not all values of j are allowed in the summa- 
tion ; rather only the odd values or the even values are permitted. The 
reason for this is connected with nuclear spin and is adequately 
explained by Mayer and Mayer in the reference just given. We shall 
denote the partition function for which j is allowed to assume even 
integral values by Z re (T ) and the one for the odd values by Z to (T). 
The evaluation of the sum in (18) is a matter of approximation, the 
ease of which is determined largely by the value of h 2 /Sir 2 IkT, which, 
following Mayer and Mayer, we call <r for brevity. The values of oT 
for certain molecules are taken from Mayer and Mayer and presented 
in the accompanying table. For most diatomic molecules n«l for 
room temperature and indeed <r < 1 for temperatures in the neighbor- 

4 Cf. Mayer and Mayer, “Statistical Mechanics,” pp. 150, 172. 
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hood of the boiling points, the only exception being provided by 
hydrogen. 

Table of o-T Values for Diatomic Molecules 


Molecule 

<jT (in degrees C) 

H, 

84 97 

I* 

0 053 

N, 

2 85 

0, 

2 06 

HC1 

14 95 


If <r is the order of 0.5 or greater 

Z r (T) = 1 + 3e -2 ' + 5e“ 8 ' + 7e -12 ' (19) 

is a sufficiently good approximation. For smaller values of a, Mayer 
and Mayer have used the Euler-Maclaurin summation formula and 
have obtained the approximation, convergent for o ^ 1 

ir,(D-i(i+r + ^ + ^+"-). (20) 


They also find that for small a, i.e. less than 0.2, 

Z r .(T) = Z ro (T) = \Z r {T). (21) 


The contribution of the rotational energy states to the specific heat 
can now be obtained at once from eq. (5) with the result that 


cv 


= £( 1+ s + ^ + ''’) 


( 22 ) 


for <r $ 1.0. This holds for small a independently of whether the 
atoms of the molecule are the same or different since the derivative of 
the logarithm of Z re or Z ro is the same as that of Z r (from 21). Equa- 
tion (22) possesses considerable interest. It shows that for high tem- 
perature (for which a — » 0), cy — * k/m. As a matter of fact this lim- 
iting value is closely approached even at room temperature for nitrogen 
and oxygen and is reasonably well approximated even for hydrogen. 
For these gases under these conditions, then, the total contribution to 
the specific heat from translation and rotation becomes (3/2 + l)i k/m 
— (5/2) *i k/m. This confirms the conjecture in eq. (32), Sec. 2 of 
Chapter V, which is well substantiated by experimental observation. 
It suggests that at and above room temperature the vibrational energy 
states contribute little to the specific heat of a diatomic gas. We shall 
look into this theoretically presently. In the meantime we note that 
as T grows smaller, a increases and the contribution of the rotational 
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energy to the specific heat increases. However with further increase of 
a, i.e. beyond unity, eq. (22) ceases to hold and a new expression based 
on (19) is indicated, but it is unnecessary to derive this in order to see 
that as a increases, ultimately Z r approaches the constant value unity 
and Cv approaches zero. We therefore expect at very low temperature 
little contribution to the specific heat from the rotational energy states. 
This agrees with the experimental observation that the specific heats of 
diatomic gases at constant volume decrease as tha temperature 
decreases. 

We now investigate the partition function for the vibrational energy 
states. From (14) this becomes 

00 

Z V {T) = 23 e~ (l+m,/kT . (23) 

7=o 


Here the elementary weights g, all reduce to unity, 
we can write 


Z V {T) - 

3-0 

' e -h./2kT 

= ! _ e -h,/kT- 


e -,h./kT 


Since e~ h,/kT < 1, 


(24) 


From this 


log Z V (T) = - ~ l0g (1 ” e ~ h ’ ,kT) ' 


(25) 


and eq. (5) yields, after carrying out the indicated operations 

* (hv \ 2 e h ’ /kT 

Cr ~m \kT/ (c h ’ /kT - l) 2 ’ (26) 

as the contribution of the vibrational energy states to the specific heat. 
At very high temperatures hv/kT becomes considerably less than unity 
and inspection shows that under these conditions cv —*■ k/m. How- 
ever at room temperature and below, hv/kT > 1, e.g., for H 2 at 300° K, 
hv/kT = 20. Under these circumstances 8 

k {hv/kT) 2 

C V hr/kT 1 

m e 

8 For data on values of the frequency r for diatomic molecules, see John C 
Slater, “Introduction to Chemical Physics,” pp. 132, 141, McGraw-Hill, New York, 
1939. 
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or a very small fraction of k/m. Compared with the contributions of 
the translational and rotational energies, that of the vibrational energy 
at room temperature is negligible. This of course finds confirmation in 
the experimental results already referred to. The increase in specific 
heat at very high temperatures due to the vibrational energy contribu- 
tion is also borne out by experimental measurements.' 

So far as the use of statistics is concerned the same method outlined 
above for the determination of the specific heats of diatomic gases is 
also directly applicable to polyatomic gases. The quantum mechanical 
problem of determining the energy states of rotation and vibration is 
of course much more difficult because of the greater number of degrees 
of freedom involved. 7 


3. SPECIFIC HEAT OF A CRYSTALLINE SOLID 


A crystalline solid is an aggregate of atomic particles whose posi- 
tions of stable equilibrium form a regular three dimensional array or 
lattice. The atoms do not remain at rest in their equilibrium positions 
but move about them in vibratory motion with frequencies determined 
by the nature of the forces acting between them. Specifically a solid 
crystal consisting of N similar atoms has 3N degrees of freedom, and 
from the classical mechanics of vibrating systems it follows that such a 
system can oscillate in 3 N harmonic modes. Quantum mechanically 
the possible energy values of the crystal are given by 


3 V 

En =^(w, + \)hv } . 


(27) 


j-i 


where the 3 N values of vj are the frequencies corresponding to the 
possible modes of oscillation and n, can take on all positive integral 
values for each j. Equation (27) is the generalization of (14). 

The partition function for the crystal becomes 


Z{T) =Y j e~*'' hT 

«**0 





(28) 


• Cf., for example, the data on CO quoted by Slater, op. cit., p. 14S. 

7 The reader will find an adequate discussion in Mayer and Mayer, op. cil., 
pp. 179 ff. 
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The exponential can be written as a product and hence Z can be 
expressed as a product of sums. Thus 


3JV » 

Z(T) = ns e -lnj+X>h*j/kT 

J “ 1 n,«0 

m g -k,,/2kT \ 

1 - e- h '> /kT )' 


(29) 


the second step being equivalent to that taken in (24). Further 
log Z(D = - X) (1 - e“ A '' /ir )]. (30) 




We now make the assumption that the same statistical method 
which we have applied to gases should also hold on suitable modifica- 
tion for the 3N particles of the solid crystal. Examination of the 
fundamental postulates in Chapter VII shows that this is a justifiable 
procedure. Only one slight modification is necessary. In our previous 
discussion of ideal gases the partition function referred to a single 
particle of the gas. Now the partition function in (28) refers to the 
whole assembly of oscillators. Consequently we must alter the fun- 
damental formulas (3) and (4) giving the energy of the system in terms 
of the partition function by leaving out the factor N, since it already is 
included in Z. Thus we now have for our crystal in place of (4) 




(31) 


The corresponding specific heat expression becomes 


Cy 


- 

Nm dT . 


r 2 — logz(r) 


- V 


(32) 


For the crystalline solid this takes the form (from 30) 

3N 

k d |„ 0 & 1 
cy 


k J_ 
Nm dT 




The result of the carrying out of the indicated operations is 

k ( hv ) /kT) 2 e hr,/kT 
Cr Nm 


Nm fzi (e h ’ ,/kT ~ l) a 


(34) 
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Let us suppose that the temperature is high so that for every v } , 
hv,/kT « 1. Each term in the sum in (34) becomes practically unity 
and the result is 



(35) 


A more accurate approximation is 





36 N jzi \kT/ J 


(36) 


The gram molecular heat capacity at high temperatures should thus 
approximate for all monatomic crystalline solids 

Cr = 3R = 5 96 cal/degree C. (37) 


This is the law of Dulong and Petit. For most monatomic crystalline 
solids it is in excellent agreement with experiment even at room tem- 
perature. 

At low temperatures (34) approaches the asymptotic form 

3AT 

cf = ^ (hv,/kT) 2 e~ h ’> /kT , (38) 


indicating that as the temperature approaches absolute zero, the 
specific heat should approach zero. .This is in general agreement with 
experiment though the precise rate of approach to zero is not in accord 
with the exponential law (38). 

The endeavor to render the theory of the specific heat of a crystal- 
line solid more exact necessitates the evaluation of the sum in (34) 
and this in turn demands a knowledge of the frequencies v,. The most 
important attempt at a solution of this problem is due to Debye. 8 
Debye replaces the sum in (34) by an integral over the whole frequency 
range from zero to a certain maximum frequency and seeks the dis- 
tribution of the oscillations among the various frequencies by finding 
the number of possible modes of oscillation of the crystal lattice in 
each frequency interval. 

Let us suppose that the crystal is replaced by an equivalent elastic 
solid medium taken in the form of a cube with side equal to l. The 
assumption was made by Debye that the actual lattice vibrations are 

• Ann. itr Physik 39, 789 (1912). Cf. also the discussion of the Debye theory 
in Slater, op. cit., pp. 222 ff. and Mayer and Mayer, “Statistical Mechanics,” 
pp. 248 ff. 
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the same as the allowed modes of elastic wave vibration (stationary 
elastic waves) in the equivalent continuous elastic medium. The 
latter are solutions of the wave equation for the two types of elastic 
waves in a solid medium subject to the boundary conditions at the 
faces of the cube. The two wave equations are respectively 9 


and 



(39) 

(40) 


Here S is the dilatation in the medium and eq. (39) represents the 
propagation of 3 as a longitudinal wave with velocity 


Cl 


4 


k + 4n/3 


(41) 


where k is the bulk modulus, n the shear modulus, and p the density. 
In (40) £ is a transverse displacement propagated with velocity 



We assume that the solid cube is traversed by plane harmonic waves of 
both longitudinal and transverse types. Such a wave will correspond 
to a displacement in the form 10 


Ae lr ' rt e~ {az+l,v+1 ' t \ 


(43) 


where A is the amplitude of the displacement and a, f), y are the direc- 
tion cosines of the normal to the plane wave front. The imposition 
of the boundary conditions assures that these progressive plane waves 
will become stationary plane waves with space part in the form 

. 2irr .2tv . 2vv 

sin — a* 'Sin — py- sin — yz, (44) 


' Cf., for example, Stewart and Lindsay, "Acoustics,” pp. 328 ff. D. Van Nos- 
traad Co., New York, 1930. 

w Cf. Slater and Frank, “ Introduction ' to Theoretical Physics,” p, 253, 
McGraw-Hill, New York, 1933. 
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where not all values of the frequency v and the direction cosines 
a, p, y are allowed, but only those which satisfy the boundary condi- 
tions, expressible in the form 


2rv . 2wv . 2vv 

Sin — at = sm — pi = sin — yl 
c c c 


0 . 


These are satisfied for 


nic „ «ac n 8 c 

" a = 17 ; * m !F n ~Ti 


(45) 

(46) 


where «i, » 2 , n 3 are any three integers. Since a* + j8 2 + y 2 = 1, 
(46) suffices to fix the allowed frequencies of plane stationary waves 
in the solid as 


III, n 21 ^”1 + ”2 + n 3- (47) 


To each triplet Bi, » 2 , n z corresponds an allowed frequency and a 
direction for the associated plane wave motion. The problem to be 
solved is to find the number of allowed frequencies in the given fre- 
quency interval v, v + dv. This is mathematically identical with the 
problem solved in Sec. 4 of Chapter VIII and we shall not need to 
present the analysis but shall merely give the final result, remarking 
only that for each direction we shall expect to find one longitudinal 
wave and two transverse waves. The total number of modes of plane 
wave harmonic vibration of the solid for which the frequency lies in 
the interval mentioned then becomes (with V = l 3 — volume of the 
cube) 

N(v)dv = 4tVv 2 dv. (48) 


We now follow Debye in the hypothesis that to a first approximation 
we can replace the real discrete distribution of frequencies in the 
actual crystal lattice by the continuous distribution (48). Since the 
total number of allowed frequencies is 3N, there must be an upper limit 
to v in (48). In fact we can compute this v mtx from the condition that 



3 N. 


(49) 


This results in 


9N 

4xF(2/cf + I/*?)' 


( 50 ) 
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We can use (50) to express (48) in the form 


N(y)dv = 


9 Nv a . 

K maz 


(51) 


The specific heat expression in (34) can now be written in the integral 
form 



Let us abbreviate by writing w = hv/kT, whereupon (52) becomes 


cy 


9k w*e a . 

Jo (e“- l) 2 W ' 


(53) 


It is usual to introduce the so-called Debye characteristic temperature 
defined by 


so that 


e„ - (54) 

_ ©i> /e -v 

Wiaax y, ‘ (-55) 


Equation (53) expresses cy as a function of T/@d. The integration 
will not be undertaken. However we can get one interesting result 
at once by noting that 



4ir 4 
15 ' 


(56) 


Hence for low temperatures where w mex 
the specific heat is given by 

. 12a -4 k 


Cv 


5 


m 


may be expected to be large. 


_T3 

02 ) 


(57) 


This dependence of the specific heat on the third power of T for small 
T is well substantiated for a number of substances. 

The accompanying figure (Fig. 9 • 1) gives a plot of cy as a function 
of T/Qjj. This should of course be the same for all crystalline solids. 
The distinction between various solids comes in the value of @i>. This 
characteristic temperature has been computed for several elements 
both from observed specific heat data and from the experimental 
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elastic constants (by means of SO). The values obtained by the two 
methods agree rather well. 11 For example, the value of 0 d for alumi- 
num calculated from the observed specific heat variation is 398° K 



and that obtained from the elastic constants is 399° K. Another 
possibility is to obtain by fitting the low temperature range as 
accurately as possible and not considering the higher temperatures. 
If this is done for aluminum the value is 385° K. H 

PROBLEMS 

1. Use eqs. (6) and (7) to derive the explicit expressions for the specific heat at 
constant volume of an ideal monatomic gas on the Fermi-Dirac and Bose-Einstein 
statistics respectively. Show that to a first approximation the classical formula 
ty — 34/2 m results. Estimate the magnitude of the correction for oxygen at its 
critical temperature, in the Bose-Einstein case 

2 . Derive the approximation (20) for the rotational partition function for an 
ideal diatomic gas. 

3. Prove that a plane harmonic wave of frequency v progressing in a direction 
with direction cosines a, p, y, has its displacement in the form 

{ = At arflrf-jCos+ft'+T*)], 

Find the expression for the displacement in the stationary waves which arise from the 
reflection of plane progressive waves of the above type in a cubical vessel of side /. 

4 . Calculate the Debye characteristic temperature for copper, silver, and alumi- 
num from the observed elastic constants. Compare the results with the char- 
acteristic temperature for the same metals obtained from the observed specific heat 
values at low temperature. 

11 For details, see Slater, op. at , p. 237 

11 For a discussion of recent attempts to improve on Debye’s theory by a more 
careful evaluation of the actual frequency spectrum of the crystalline solid, consult 
the book by Mott and Jones, "The Theory of the Properties of Metals and Alloys,’’ 
pp. 6 ff., Oxford, 1936. 
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5. Show that the entropy of a crystalline solid can be written in the form 

Find the result to which this reduces if all 3 N frequencies of the crystal are assumed 
to be identical and equal to *. Find the corresponding result on the Debye theory 
of frequency distribution. Show that on the Debye theory at temperatures low com 
pared with the characteristic temperature the entropy can be written approximately 

'—is©'- )■ ’ 

6. Derive the expression for the free energy of a crystalline solid on the Debye 
theory and show that at temperatures high compared with the characteristic tem- 
perature the value is approximately 

*-3N*r[log(^)-i + ^(^) ...]• 



CHAPTER X 


QUANTUM . STATISTICAL THEORY OF ELECTRICAL AND 
THERMAL PROPERTIES OF METALS 

1. THE LORENTZ-SOMMERFELD THEORY OF ELECTRICAL AND 
THERMAL CONDUCTION 

The most elaborate classical statistical theory of the conduction of 
electricity and heat in metals is due to Lorentz . 1 He attributed both 
effects to the motion of free electrons and treated the electron gas in 
the metal by the classical Maxwell-Boltzmann statistics. Sommerfeld 2 
modified the Lorentz theory by the introduction of quantum statistics. 
We have already noted that an electron gas such as the Lorentz 
theory envisages in a metal with approximately one electron per metal- 
lic atom, must be strongly degenerate even at very high temperatures. 
The distribution law to be used will therefore be that given in eq. (47) 
of Chapter VIII with a = -f 1, and with 71 given by eq. (129) of 
Chapter VIII. We find it convenient to express it in terms of velocity 
components rather than energy. This makes it necessary to rewrite 
the expression for n } in terms of the velocity components v x , v y , v, in 
place of energy. To revert to Sec. 4, Chapter VIII we see that the 
possible energy values of a free particle in a closed vessel, given in 
eq. (59), are really equivalent to saying that the possible kinetic 
energy values corresponding to velocity components v xi v y , v, along 
the three coordinate axes respectively are 

(Mk = , 

(W)-» = ^2 . 


(§*»$ n, 


nlh 2 

8m/f 


1 H. A Lorentz, "Theory of Electrons,” pp. 267 ff., New York, 1916. 

‘A. Sommerfeld, Z. Phystk 47, 1 (1928). See also "Handbuch der Physik,” 
second edition, Voi. 24, p. 33 3. Springer, Berlin, 1933. - 
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This should be clear from eqs. (55) and (58) of Chapter VIII. Hence 
the number of energy values for which v x lies between v x and v x + dv x is 
simply 


dni 

2 



h 


with similar expressions for the y and z directions. The total number 
of energy values for which v x is in the interval v x , v x ~\- dv x &n& similarly 
for v y and v, then becomes * 


dnidn 2 dna m 3 

g rdVxdVydvg. 


(i) 


If we assign two electrons to each energy value to take care of the spin, 
we have for our new n } the expression 




2m 3 

H 3 


t dVxdVydVf, 


( 2 ) 


and the distribution law to be used in our discussion takes the form 

2rm 3 /h a ■ dv x dvydv. 


dN — 

1 -f_ g-7l g m(iJ+e^+»J)/ 2 *r 

For the sake of convenience we shall mainly employ 
dN 2m 3 1 


fo(x, V X , Vy, v,) = 


2m 3 

rdv x dvydv , h 3 i -j. e -™ e m b\+'\+’\)/ 2kT 


(3) 


(4) 


The function / 0 is written as a function of x as well as v x , v v , v t since we 
are assuming that the metal in question is in the form of a rod directed 
along the x axis. We wish to express the fact that the rod may not be 
homogeneous, whence -yj will depend on x. Moreover if a temperature 
gradient exists, T will be a function of x. Hence the distribution func- 
tion will in general depend on x. 

If the metal is subjected to no external influences the distribution 
function /o will remain unchanged at any particular place in the metal. 
We must indeed expect the electrons to suffer energy changes by col- 
lision with the metallic atoms, but on the average as many electrons 
will be expected to gain as to lose energy and the fo should not be 
altered by this effect. However, the situation will be different when an 
electric field $(x) is applied to the metal. 

1 We revert to the use of r for physical volume. 



246 ELECTRICAL AND THERMAL PROPERTIES OF METALS [Ch. X 


Consider the electrons at a given instant t in a smalt volume dr of 
the metal at the point x, y, z and with their velocity components in the 
interval v x , v x + dv x , etc. At the time t + dt, if there were no col- 
lisions, these electrons would be displaced to the same volume around 
the point (x + Vydt, y + Vydt, z -f- v^dt) and their velocity components 
would become v x + e&/m-dt, v y , Note that we are assuming that 
the applied field acts only along the x axis. Now because of collisions 
with the atpms of the metal a certain number of electrons having 
velocity components in the interval in question at tiiqe t will have 
passed out of this interval by t + dt. Let this number per second be 
denoted by 

adrdvxdvydv,. 

Similarly we shall assume that 

bdrdvy/dvydv t 

have their velocity components brought into the interval per second. 
If there were no external field acting, equilibrium would be maintained 
by 

a = b. 

However in general we shall haw when a steady state is established 

e(o 

f(x + Vydt, y + Vydt, z + Vgdt, v x H dt, v y , v,) 


- /(*. V, *, Vx, Vy, V.) + (b - a)dt, 


( 5 ) 


where / is the generalized distribution function in the presence of the 
external field When the field vanishes / reduces to / 0 . Equation (5) 
leads to 


b — a — 


v tH + 'AHL. 

x dx m dv x 


( 6 ) 


df df 

Because / is a function of * alone, — and — are absent in (6). 

dy dz 

In order to utilize this equation it is necessary to obtain an expression 
for b — a. This involves a study of the collisions of electrons with 
atoms. Let us assume that the atoms are elastic spheres of radius 
R and that the number of electrons per unit volume is equal to n. The 
collisions of the electrons with the atoms are then elastic. In Fig. 10 1 
we have indicated such an atom with center at 0. Construct the solid 
angle du at 0 subtending the surface element dS = R 2 du at the surface 
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of the atom. Consider the electrons with velocity v in the velocity 
interval v x , v x + dv x , etc. The number of these which strike dS per 
second is the number in a cylinder of length v and base area dS - cos jS, 



where /8 is the angle between v and the inward drawn normal to dS. 
Consequently the number per unit volume is 

nR 2 du> cos 0 vf(x, y, z, v x , v v , vjdvgdvydv,. 


Hence the average number of collisions per unit volume per second 
suffered by the electrons in the given velocity interval would appear to 
be 


dvJ.v u dv I J' nR 2 v cos 0 f(x, y, z, v x , v v , v,)du. 


( 7 ) 


This assumes that all the collisions are possible, but this is true only if 
the gas is one obeying the classical Maxwell-Boltzmann statistics In 
the Fermi-Dirac gas, which we are considering here, only those 
collisions are possible for which the final state was originally empty. 
Consequently we must correct (7) by writing it in the form 


dv x dv v dv t J'\y \ — f(x, y,z,v x ,v v ,v t ) nR 2 v cos 0 f{x, y, z, v x ,v y ,v,)do> 


where v x , v v , v, are the component velocities after collision. The num- 
ber of collisions for which f{x, y, z, v x , v v , v,) = 2m s /h 3 then becomes 
zero. The integral in (V) is the expression for a in (6). 

Since we are assuming that the collisions are elastic we can calculate 
the velocity components after collision by the law of reflection on 
elastic impact. Thus, the velocity after impact being v' with com- 
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ponents v x , v y , v, (where | v' | = ] v |) the projection of v' — v on the 
normal to the surface element dS is 2v cos P, and hence if the direction 
cosines of the normal are X, p, v, we have 

v x — v x - 2v cos /9-X 

Vy — Vy = 2v cos p-p (8) 

v g — v, = 2v cos P-v. 

t 

This assumes, of course, that the atoms of the metal are stationary. 
Since they are so much more massive than the electrons, this assump- 
tion is reasonable. Similarly if the original velocity components are 
v' x , v'y, v, the final ones after impact will be v x , v y , v, as given in (8). 
Hence the expression for b in (6) is simply the integral 


f [1 — f(x, y, 2 , v x , Vy, v t ]nR 2 v cos P /(x, y, z, v x , v y , v,)dw (9) 


where the coefficient [1 — /(x, y, z, v x , v y , »*)] is inserted for the same 
reason as in (7'). Let us now assume that the distribution function as 
altered by the field can be expressed in terms of the original distribu- 
tion function f 0 by means of the relation 


/ = fo + v t -x(x, v t , Vy, Vg), 


( 10 ) 


where x» like/ 0 , is assumed t o be a functio n of the velocity components 
through the magnitude + rj only. This assumption is 

rendered plausible by the recollection that the field is directed along the 
x axis and hence should be expected to change / principally through v x . 
Equation (6) now takes the form 


/ df 0 e^(dfo , , dx\ 

Hte + v + ;ls +,H 


= nR 2 vx (x. 


,v) f { v x - v, 


) cos P du, 


(ID 


if we assume that x(*, v' x , v' y , v',) = x(x, v x , v y , v,), i.e., the total energy 
of the electron remains unchanged by the collision. If we consider x as 


a sort of correction function it will be proper to neglect v x ~ compared 

ox 


with ~ and x + v x — ■ compared with ~ 
dx dv x cVg 


Moreover we shall write 
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3p we use ® rst equation in (8). Then eq. (11) 

becomes 


dfo , e$ v x 

Vx T 1 

dx mv 


dfo 

dv 


= — 2 nR 2 i?xfa p) J' h cos 8 0 dw. 


( 12 ) 


We must now evaluate the integral on the right side. Let a be the 
angle between the normal to the atomic surface at dS anjl the * axis. 
Thus X = cos a. We now take the velocity v as the polaf axis of a 
system of spherical coordinates in which 0 is the co-latitude angle 
and <t> is, as usual, the longitude . By the application of the usual rule for 
the cosine of the angle between two lines we have 


Then 


X = cos a = — cos 0 -j- 

V 


dS 


+ Vj 


sin 0 cos <f>. 


da> — ~ ®in 0 d0 d<l>. 


Consequently the integral in (12) becomes 


/ 


X cos 2 0 du> 


/ w/2 

Si 

/'ft 


sin 0 cos 2 0 d0. 


vV -f u 2 

cos 0 H sin 0 cos <j> 

v 


^dtf>. 


where the upper limit of the 0 integration is r/2 instead of ir since we 
are clearly entitled to count in our calc ulation the solid angle for a 
hemisphere only. The term involving "V^d 2 + goes out in the inte- 
gration and we are finally left with 


/ 


X cos 2 0 du = — 5 • 
2v 


The resulting differential equation for / 0 then becomes 
dx mv dv 


(13) 


In line with our classical kinetic theory discussion in Sec. 3 of Chapter 
V, it is plausible to define the effective mean free path of the electrons, as 
far as collisions with the atoms of the metal are concerned, as 


l 


l 


vnR 3 


(14) 
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With this definition we can write (13) in the form 

_ _ if Vo e(?df(\ 

v \3* tnv dv ) 


(15) 


It should be pointed out that it is unnecessary to take the definition 
(14) too seriously. We could carry the quantity 1/irnR 2 through our 
analysis without giving it a special name if we chose. As a matter of 
fact, the theory being presented here is a formal one in the sense that 
we shall not attempt to calculate Z in a precise manner. The quantity 
1 /irtiR 2 has the dimensions of length and indeed if we insert reasonable 
values of n and R, e.g., for silver, ft — 5.9 X 10 22 , R <■»■' 10 -8 cm, l 
comes out of the order of 5 X 10 -8 cm, which is not unreasonable. It 
will later prove possible to deduce formulas from which l disappears by 
cancellation. We shall naturally be able to attach greater significance 
to such equations as tests of the theory. 

We are now ready to evaluate the electric and thermal current 
densities in the direction of the x axis. The electric current density or 
rate of flow of charge per unit area per second is by definition 


+<*> 

-‘Iff 


v x f dv x dv y dv,. 


(16) 


The thermal current density or the total rate of transfer of kinetic 
energy per unit area per second is 


+ 00 

- ! f/P 


v x f dv x dv y dv,. 


( 1 ?) 


Substituting / = / 0 + v x x and recalling that / 0 is an even function of 


v x , v y , v„ we get 


Similarly 


J = e fff £ X dv x dv, dv,. 

— 00 

C-f J'J'J’ vixv 2 dv , dvy dv,. 


(18) 


(19) 


When we proceed now to spherical coordinates, we replace dv x dv y dv, by 
v 2 sin 8 dv d9 d<f>, with i> 2 = -j- vj + vf, 8 the angle v makes with the v. 
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axis and the angle its projection on the v x v y plane makes with the v* 
axis. Integrating out the 6 and <t> parts gives 


and 


The next step is to substitute x from (15) into these expressions. The 
3 L Jo dx tn Jq dv J 


j “ *■ 

( 20 ) 

c - 2 ~r£’‘ xdv - 

( 21 ) 


C =- 


2irm 


[£ 


Vj&*+ 

dx 


m 




(23) 


An integration by parts performed on the second integral in each 
expression yields 

'-“-[If «« 


C - 


2irm 


It is convenient at this point to introduce the change in variable, 

Q 

mv 2 /2kT = u. Then the operator — becomes 


d_ 

dv 


mv d 

kfdu' 


while 


d 3 

dv — = du — • 
dv du 


kT 


Now for convenience let to 2 = L, whence lv z dv — — L du. Moreover 

m 

2 kT 

Iv* = i > 2 L — — - uL, etc., so that the expressions (24) and (25) now 
tn 

appear as 
4xe 


J = 


L m j o ou tn Jq ox j 

*S\!£.M r u 4 &. *1 (27) 
3 Lm tn Jo du 2 \ m I J 0 dx J 
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fo given in eq. (4) takes the form 

, 2m 3 

fo = TF 


-7»+» 


A 8 1 +e 

Hence on the assumption that in general both 71 and T vary with x 

(28) 


We can integrate 


bL 


bfo _ d / 0 / dyi , « 

dx bu \bx T bx) 

f* Vo 

Jo du 


Ldu by parts and find it equal to 


— / f 0 — du, since the integrated part vanishes at both limits. 
Jo bu 

Similarly 

P f uLdu =- f f* (uL)du. 

Jo bu Jo bu 

Finally, if we abbreviate by setting 

Jo bu Jo bu 


i fo-^~ du = A 3 


(29) 


the two current densities become 


J A x \e^-kT^f (30) 
3m L bx bx Ai. 


4ir kT 

C = T7 A * 

OfW 


j^e<£ — 


kT~ -k 
bx bx 


ar^] 

bx Avj 


(31) 


We are now ready to apply the formulas (30) and (31). If the medium 
is homogeneous and there exists no temperature gradient, (30) reduces 

■'-FF 1 - 


The electrical conductivity is defined as the reciprocal of the specific 
resistance, which is the ratio of the electric field intensity to the current 
density. Hence for the conductivity, a, we have 

J 4 T?A t 
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From (29) 


Ai 


2 kT 
m 


' r* w,, 

I Jo — — du. 
Jo 


du 


(34) 


We must assume that / is a function of u, though we do not know the 
precise dependence. Let us expand ul(u) in a power series : 


oo 

ui 


(35) 


and 


J -0 


d(ut) , 

- IT -2-,0 + 1 jay. 


du 


y-o 


Substitution into (34) yields 


2 j kT'ST' /** 

Ai - 2_^ 0’ + f 0 u 3 du. (36) 

We recognize h 3 /2m 3 ■ j fou’du as identical with U'(j, y{) in eq. (1 22) 
Jo 

of Chapter VIII. The first approximation yields 


_ 2kT 2m 3 ^ 

Al ~ m * “F 2 ^ 0 + (j + 1) ’ 
which from (35) becomes 


1 )B, 


■A +l 


IkT 2 m a 

Al = • “jTgf 7l*m). 


m 


(37) 


where l(y{) means the value of l(u) with yi inserted for u. The con- 
ductivity to the first approximation takes the form 

l6irtne 2 kT 


a = 


3h 3 


yiKyi)- 


(38) 


Note from eq. (138) of Chapter VIII that /(- yj) is the value of the mean 
free path corresponding to the velocity v Q . The utility of this formula 
for electrical conductivity is severely restricted by our ignorance of the 
dependence of l on the velocity. By using the first approximation to 
the degenerate form of 71 from (128) of Chapter VIII with 2 t in place 
of r we can rewrite (38) in the form 


a — 


8 ire' 
3A 


/3N\ h 

\8irr/ ‘ 


Ky i). 


(39) 
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In this formula the temperature no longer enters explicitly. The 
temperature variation of conductivity must thereforebe sought in that 
of N/t and of the mean free path l(yi). Since the change of N/r with 
temperature is a matter of thermal expansion only, it is extremely 
slight and scarcely competent to account for the change in a. Hence 
the burden is laid on f(yi). Calculations based on (39) using the 
experimental values of a indicate that for copper, for example, on the 
assumption of one free electron per atom l will vary from about 
7 X 10 -7 cm at 1000° C to about 4 X 10“ a cm at -200° C. These 
figures are larger than the value 5 X 10“ 8 cm computed from the 
simple formula 

*"3P 


with 10~ 8 cm for R. This is an indication of the formal character of 
the theory developed in this section. The assumption that the atoms 
are perfectly elastic spheres is clearly not a particularly good one, 
unless we are willing to assign to the radius of such spheres a smaller 
value than 10 — 8 cm. This is, of course, a possibility; the effective 
radius of an atom for a collision may be much smaller than the dimen- 
sions of the core of the atom from the quantum mechanical standpoint. 
The variation of l with temperature is another matter untouched by the 
formal theory. 

let us next proceed to the thermal conductivity. In eq. (30), 
suppose that 7 = 0, i.e,, zero electric current flow. The result shows 
that there must still exist an electric field intensity as long as a tem- 
perature gradient exists or the metal is non-homogeneous. Its value 
is given by 


’= kT — 
dx 


+ 


k *1*1. 
dx Ai 


(40) 


We substitute this into the expression for the heat flow (31) and obtain 


C = 


4 irkT dT j4j 
~3m * k lte[ Ti 



(41) 


The thermal conductivity k, which by definition is the ratio of C to the 
negative temperature gradient, becomes 
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To evaluate this we must compute A 2 and A a . From (29) we have 

*-vr[*rS? + * h ]*- <«> 

We now fall back on the expansion (35) for ul. By the use of this, (43) 
becomes 

. 2kT .^,1 . . 


A 2 ~ ~m~ [ /o £ O' + 2 )By +1 ] du. • (44) 

We now revert to the expression for U' in eq. (122) of Chapter VIII, 
and can write to the second approximation 

A, ~if if + 2 “ 

_ + ** ( 2IM + 4,1 (£).,+ 7i (lf=)..)]' (45) 


Next we must get A a as 




By proceeding as before, we have to the second approximation 


- jT[/o2J 0 + 3)5X+ 2 ] du 
2 kT 2m 3 f . . 

-- n r [■>„. + . 


• ^[ 7?/(7i) + 2c2 ( 67i/(7i > + *m (46) 

We are now ready to evaluate k. In the ratio A 2 /Ai we need to 
consider only the first approximation to A\, namely, that given in 
(37), since simple inspection shows that the terms in the next approxi- 
mation involve derivatives of l only, which we shall neglect throughout. 
However, in A% and A% we must consider that part of the second 
approximation which does not involve the derivatives. We finally 
obtain 

^4 2 A 7?f(7i) ~h 4cgl(7i) __ 4ca 

A i yiKyi) 71 Ti 
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and 


yi + 


8c;2 


■A 3 ^ -f 12ca7iK*yi) 

A 2 7ii(7i) + 4e fl /(y 2 ) 7i 

Consequently the heat conductivity becomes (with t 2 /12 for £ 2 ) 


l6irk 3 T 2 m 


yViyi) 


4 c 2 
7l 


i67r 3 ^ 3 r 2 w 


3h 4 ,1V,,/ 71 9 h 3 

* 

Let us now take the ratio of * to a T. The result is 

k 16ir*k 3 T 2 m 3h 3 7T 2 k 2 

cT ~ 9 h 3 \b*me 2 kT 2 “3c 2 ' 


7iK7i). 


(47) 


(48) 


We have thus derived the well-known Wiedemann-Franz law connect- 
ing the electrical and thermal conductivities of metallic conductors. 
If we use the electromagnetic unit of charge for e, the value of the 
constant on the right side of (48) turns out to be approximately 


«l k l 

3 e 2 


= 2 44 X 10 8 


°rgs 2 

(degree L) 2 (emu) 2 


(49) 


At room temperature (293° K) we have 


- = 7.15 X 10 10 
o 

in absolute units. This agrees rather well with the mean of the experi- 
mental values for such metals as Al, Cu, Ag, Ni, Zn, Cd, Pb, Sn, Pt, 
Pd, Fe. The following table (taken from H. Frohlich, “Elektronen 
Theorie der Metalle," Springer, Berlin, 1936) shows the nature of the 
agreement. It gives the experimental values of k/<tT for two different 
temperatures. 



Metal ] 

Cu 

Ag 

Au 1 

Zn 

Cd 

Pb 

Fe 



2 28 

2 36 

2 43 

2 31 

2 42 | 

2 45 

2 88 











Of these metals, iron is the only one to deviate notably from the 
theoretical value (49). In general the values for various steels show 
considerable increase with temperature. There are indeed more 
notable exceptions to the law. The constant for rhodium, for example, 
is only 1.33. In general, however, the law may be taken to be well 
established. 






Sec. 2] 


HEAT PRODUCTION IN A CONDUCTOR 


257 


It should be pointed out that though the classical statistical 
theory of Lorentz also leads to a law of the form (48), the value of the 
constant comes out to be 2k 2 /e 2 in place of v 2 /3-k 2 /e 2 . This is in 
definite disagreement with the experimental values for most metals 
measured and constitutes a serious objection to the Lorentz theory. 
The reader may show that, as might be expected, the Lorentz result 
can be obtained from the quantum statistical theory above outlined 
if one assumes a non-degenerate gas. This makes clear, the essential 
failure of the Lorentz theory from the standpoint of quantum statistics. 

At the same time it must be emphasized that the Sommerfeld 
application of the Fermi-Dirac statistics is by no means a complete 
picture. It dodges entirely the fundamental question of the mean free 
path of the electrons and treats the interaction between the electrons 
and the atoms of the metal in a purely superficial fashion. Only a 
thoroughgoing application of quantum mechanics can overcome these 
defects. As a first approximation, at any rate, the Sommerfeld theory 
must be viewed as a successful application of quantum statistics. 


2. HEAT PRODUCTION IN A CONDUCTOR. THE THOMSON EFFECT 

The work done per second by the electric field S when an electric 
current / flows through the length dx of a conductor is Sldx = SJSdx, 
where S is the area of cross-section. The power expended per unit 
volume is therefore JS. Denoting, as usual, by C the rate of flow of 
heat per second per unit area, the rate of increase in heat content per 
dC 

unit volume is — — • Hence the net rate of heat production per unit 
volume is 

dC 

(50) 

From eq. (30) we can solve for S in terms of J, etc. Thus substituting 
for Ai in terms of the electrical conductivity o from eq (33), we have 


kdTA 2 kTdyi 

J a H — — — - — I —• 

e dx Aj e dx 


(51) 


The next step is to substitute this expression for S in that for C 
(eq. 31). The result is 


4 vkT . 
C = — — A 2 


'eJ_ .dTfAa 


dX \j4a 



(52) 


3m 
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From eq. (42) we can solve for 



in terms 


of «e and 


get 



3m< 

i*k 2 TA 2 ’ 


(53) 


whence eq. (52) can be written 

AvkTeJAa &T 

3mo dx 


The quantity Q therefore becomes 


Q 



or, using (51) again, 


AvekJ d /TA 2 \ 
3m dx\ <r )' 


Q = 



+ 


JkT d ( A 2 \ 

e dx \ 1 AiJ 


(54) 


(55) 


(56) 


This is a very interesting expression. We recognize at once the term 
J 2 /o as the Joule heat production rate per unit volume. The second 

term, — ( k — ), is the rate of heat production per unit volume due to 
dx \ ox / 

heat conduction. It is clear that we must attribute the last term in 
(56) to the thermoelectric effects. Let us review these briefly. 

We consider first the Thomson effect. If a current flows in the 
conductor in the direction of the positive x axis and if a temperature 
gradient dT/dx is maintained in the conductor, it is found that for 
certain metals in the part of the conductor in which the temperature 
gradient is positive, heat is absorbed by the metal, while in the part 
where the temperature gradient is negative, heat is evolved by the 
metal. This is true, for example, in the case of copper, which is said 
to exhibit a positive Thomson effect. On the other hand, there are 
metals in which positive dT/dx is accompanied by the evolution of 
heat and negative d T /dx by absorption of heat. Iron is an example of 
this, and the effect is known as the negative Thomson effect. In lead 
the effect is so small that it is generally considered negligible. The 
Thomson heat, unlike the Joule heat, is reversible. 

Lord Kelvin (for whom the effect is named) discovered by experi- 
ment that the thermoelectric heat energy evolved or absorbed per 
second per unit volume of the conductor, i.e., the heat additional to the 
Joule heat, is directly proportional to the product of the current 
density and the temperature gradient. The coefficient will be denoted 
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by p and called the Thomson thermoelectric coefficient. Thus if we denote 
the Thomson heat by Qt, we have 1 

ST 

<2r=^~- (57) 

From eq. (56) it follows that for a homogeneous metal 

#* = Jr ( 7l ” A d A ^- • ( 58 ) 


Substituting the value for A^/Ai given just after eq. (46), we obtain 


kT A (i£?y 
e ST\y i) 


(59) 


We must now insert the value of yi for a degenerate electron gas 
given in eq. (128) of Chapter VIII (the first approximation being 
sufficient). We replace r by 2r, as usual. Carrying out the differ- 
entiation yields 

, , 2v 2 mk 2 T , 


M I * 


3eh 2 


/SirrY* 
\3W/ ‘ 


(60) 


The slight variation of r/iV with temperature is here neglected. We 
now discuss the comparison with experiment. If we substitute into 
(60) the value of N/t = 5.9 X 10 23 appropriate for silver and take 
T = 300° K, etc., we get approximately p = 1.5 microvolts/ 0 C 
(equivalent to 1 50 ergs/emu degree C) . The experimental value of the 
Thomson coefficient for silver is 1.2 microvolts/ 0 C, and in general the 
experimental values run between 1 and 10 microvolts/ 0 C. Hence 
there is general order of magnitude agreement with experiment. This 
is interesting since the original Loren tz theory, which is equivalent to 
the present theory for the non-degenerate or very weakly degenerate 
case, gives much higher values, indeed of the order of 100 micro- 
volts/ 0 C. In fact, the reader can show by the use of the non-degener- 
ate forms of 7 i and A*/ A\ in (58) that the magnitude of p on the 

3 k 

classical Lorentz theory is the universal constant - - , which should 

« € 

1 There are many so-called Thomson coefficients, only one of which is presented 

here. It is the one which is ordinarily measured. For a fuller discussion, see Sotn- 
merfeld and Frank, “Statistical Theory of Thermoelectric, Gaivano- and Thermo- 
magnetic Phenomena in Metals,” Remews of Modem Physics 3, 1 (1931). Athorough 
discussion of the thermoelectric effects from the standpoint of thermodynamics will 
be found in P. W. Bridgman, “The Thermodynamics of Electrical Phenomena in 
Metals," Macmillan, 1934. 
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then apply to all conductors. There is one “fly in the ointment," to be 
sure, namely, the fact that the Sommerfeld theory in the simple form 
here presented is no more able than the Lorentz theory to distinguish 
between the positive and negative Thomson effects, i.e., we can cal- 
culate ( n ( only. 

3. THE PELTIER EFFECT 


We next consider a non-uniform conductor maintained at uniform 
temperature. In eq. (56) the term in dT/dx accordingly drops out and 
the heat production due to the flow of current through the non- 
homogeneous medium is given by the last term with the understanding 
that T remains constant while the medium changes its character with 
x. Let us suppose we have two different metals in “contact,” which is 
taken to mean that they are separated by a very narrow transition 
layer. We shall define the heat produced per unit area of this layer 
per second by unit current density as the Peltier heat. It is denoted 
by jt 12 , the subscripts referring to the two metallic media Therefore 
we have 




dx 


kT ' 
e . 


71 


Aik' 


(61) 


where *1 and x 2 refer to the boundaries of the transition layer in the 
two metals respectively. Utilizing the expression for the bracket 
already employed in eq (59) we obtain 


iria 


fe7Vn** 
3«7i J*, 


2ir 2 mk 2 T 2 (N\~ h (NY*! 

3eh 2 \ 3 ) L\ t /2 “\t/, J’ (62) 


where ( N/t)i and (N/t) 2 refer to the values for the first and second 
metals respectively. The calculation for the case of the copper-silver 
couple at T = 300° K, yields approximately 100 microvolts. The 
agreement with experiment is not startlingly good, since the experi- 
mental value turns out to be about —30 microvolt, the minus sign 
signifying the evolution of heat as the current goes from Cu to Ag. 
Fortunately for the Sommerfeld theory, the result calculated on the 
old Lorentz theory is even further out of the way, being around 9,000 
microvolts. It is perhaps well to emphasize that the experimental 
values are not too certain, rendering the comparison between theory 
and experiment rather precarious. 

« 
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4. THERMOELECTRIC ELECTROMOTIVE FORCE. 

THE SEEBECK EFFECT 

We next attempt the calculation of the actual thermoelectric emf 
in an open circuit. 

Consider two different metals 1 and 2 (Fig. 10*2) with junctions 
Pj and P 2 . We wish to find the potential difference between A and B, 
maintained at temperature T, while Pi and P 2 are kept at temperature 


Fig 10 2. 

V and T" respectively. Since no current flows, eq. (40) holds for the 
field intensity and therefore there exists a difference in potential 
between A and B which has the value 

- f‘su-\ f rte* ( «) 

Ja e Ja dx Ai e J A dx 

We must evaluate this for the degenerate case and hence use A 2 /Ai — 
7i + ^ 2 / 37 i. This yields 

v * B = i efA' ,,+ i') dT + ~'l T? ai ix - (64) 

r B k 

Integration by parts of the second integral yields / y\dT, whence 

Ja « 

„ r f r dT r'dT f T dr] 

AB 3e Ja Yi SelJr Yu Jr 712 Jv 7i J ’ 

where 7 n is the value of 71 for the first metal and 7 j 2 that for the 
second. The use of the first approximation for 71 (eq 128 of Chapter 
VIII) leads to 

r *rn±) iT 

3e Jt< \Yi2 7u/ 


it 2 * 2 mk 




Furthermore if we neglect to a first approximation the dependence of 
r/N on T, the approximate result is 

^ (C] (r " - r2) • < 6,) 
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This agrees much better with experimental results than the correspond- 
ing formula for the classical Lorentz case which the reader may show is 
in the form 

F^r 0 * 1 = * [log log (^)J (T" - n. (68) 

Thus for the case of a silver-sodium thermocouple substitution into 
(67) gives fo{ T" — T' — 1 degree C at 0° C 

Vab = 1-0 microvolt, 

whereas the classical formula yields for the same couple under the same 
conditions 72 microvolts. The experimental value is approximately 3 
microvolts. The reader should work out other cases for himself. 

It may indeed be shown that (67) can be put into the form: 

Vab = 54 6K(t" , (69) 

where l represents centigrade temperature and K is a constant depend- 
ing on the couple. This is in line with the actual form of the experi- 
mental observations, though the numerical values do not agree in 
every instance. 

5. TRANSVERSE GALVANOMAGNETIC AND THERMOMAGNETIC 

effects 

Less well known than the thermoelectric effects just discussed are 
those produced in a conductor through which heat or electricity is 
flowing by the presence of a magnetic field. The phenomena par- 
ticularly studied are those in which the magnetic field is applied at 
right angles to the current or heat flow. Chief among them is the Hall 
effect in which the magnetic field applied transversely to a current- 
carrying rod produces a transverse potential difference at right angles 
to both field and current which is directly proportional to the product 
of the intensity of the magnetic field and the current strength and 
inversely proportional to the transverse width of the conductor. The 
proportionality constant is the so-called Hall coefficient. There are 
three other similar effects : (a) Ettingshausen effect, or the production 
of a transverse temperature difference by the interaction of transverse 
magnetic field and electric current flow, when the conductor is ther- 
mally insulated ; (b) Nemst effect, or the production of a transverse 
potential difference when heat flow takes place lengthwise of the con- 
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ductor; and (c) Righi-Leduc effect, or the production of a transverse 
temperature difference when heat flows lengthwise of a thermally 
insulated conductor. 

All these effects constitute transport phenomena which can be 
treated by the theory of this chapter. The fundamental equation (10) 
must be generalized to take care of an additional dimension as well 
as the force on the electrons due to the magnetic field. The resulting 
transport formulas for J and C are analogous to (30) an^, (31) respec- 
tively but naturally rather more complicated. We shall not present 
the analysis or the results here 6 but merely remark that the agreement 
with experiment is reasonably good and of the same general character 
as that already encountered in the thermoelectric phenomena. One 
interesting byproduct of these studies is the change in effective resist- 
ance of a conductor due to the applied magnetic field. 


6. MOTION OF ELECTRONS IN A CRYSTAL LATTICE. 

SIMPLE TREATMENT 

The Sommerfeld theory employed in the preceding sections assumes 
that the conduction electrons are free or at any rate move in a field of 
constant potential. It takes account of the interaction of the moving 
electrons and the atoms or ions of the metal only from the classical 
and formal standpoint of elastic collisions. Actually the potential field 
in which metallic electrons move is by no means uniform and a more 
thoroughgoing application of statistics to the electrical and thermal 
properties of metals must take account of this fact. Much recent work 
has been done in this field * but a thorough survey of it lies outside the 
scope of the present volume. However, a simplified special case may 
prove of some interest as an introduction to the modem theory of 
metals. 

In order to employ a statistical distribution formula like eq. (47) of 
Chapter VIII it is essential to know the possible energy values E , and 
the number of such values in any given energy interval. In our previ- 
ous discussion we have used the values appropriate to a set of free 
electrons. We now wish to see how they are modified by the motion of 
the electrons in a force field. The actual field of force encountered in a 
crystal lattice is a complicated affair. Nevertheless if the metal is in the 
form of a single crystal, the regular arrangement of the atoms assures 

' A discussion will be found in the article of Sommerfeld and Frank already cited. 

• Cf. Mott and Jones, “The Theory of the Properties of Metals and Alloys,” 
Oxford, 1936. Also cf. A. H. Wilson, “The Theory of Metals,” Cambridge, 1936 
and F. Seitz, "The Modern Theory of Solids,” McGraw-Hill, 1940 
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that the field will be of periodic character in space. We therefore 
examine the motion of electrons through a periodic potential field. 
The simplest case of this kind is the one -dimensional lattice shown 
schematically in the following diagram (Fig. 10*3) which plots the 
variation of the potential energy V(x) with x. In the hollows of the 


*1 



Fig. 10 3. 


square saw-toothed curve the potential energy is assumed to have the 
constant value F 0 and in the crests the constant value Fj. For 
simplicity we suppose the curve extends from — oo to + oo . The 
widths of the potential energy regions are 2/ 0 and 2/ t respectively. 
This is, of course, a highly idealized picture but it does provide a simple 
type of periodically varying potential energy. 

The Schrodinger equations (cf. eqs. 3 and 52 of Chapter VIII) for 
the two regions where the potential energies are respectively V 0 and Vi 
are 

d 2 d 8ir 2 »rt 

^ + -^-(£- F„)* = 0 (hollow) (70) 

d 2 d 8ir 2 m 

^ + — (E-V 1 )d = 0 (crest). (71) 

If £ < Foaswellas£ < Fi, it is clear that the solutions of both equa- 
tions will be exponential functions with real exponents. Such func- 
tions, however, cannot correspond to genuine transmission of electrons 
through the structure, for the terms in which the exponents are nega- 
tive will more or less quickly go to zero as x increases and hence corre- 
spond to very small probability of finding electrons very far to the 
right of any chosen origin, while the terms in which the exponents are 
positive will go to infinity as x increases, which has no meaning in 
quantum mechanics. We must therefore choose £ > F<> at least, 
though it will develop that we can secure transmission under certain 
conditions if £ < Fj. In fact the latter situation is more interesting 
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than E > V\. 
Let 


Let us therefore discuss the problem for V\> E> Fq. 


8ir 2 m 


(E - F 0 ) = hi 


8ir 2 m 

~w 


(Fi - E) = k\. 


(72) 


The solutions of (70) and (71) become respectively (eft eqs. 56 of 
Chapter VIII) 

P = A 0 e ,itx + S 0 e"‘ M (hollow), (73) 

p = Aif 1 * + Sie"* 1 * (crest), (74) 


where the arbitrary constants A^, Bo, A\, Bi are in general complex. 
The transmission of electrons through the lattice is handled by setting 
up the boundary conditions expressing continuity in p and dp/dx at 
each boundary surface. Differentiating and using the prime notation 
for differentiation with respect to x we get 

p' = ik 0 (A 0 e'** - B 0 e- ,hot ) (hollow), (75) 

p' - fciUje* 1 * - Bi« -ilX ) (crest). (76) 

We shall now denote the midpoints of successive hollows by I, 3, 5 • • • 
and proceed to express the p and p' values at any point in a hollow in 
terms of their values at the midpoints denoted by the subscripts 
1,3,5 • • • . Thus we get rid of the arbitrary constants by setting 

Pi - A 0 + B 0 , 

p'x = (A o — Bq), 


whence p and p’ for any point between 1 and the next boundary sur- 
face become 


P 


= Pi cos kox 4- — sin fax, 
k o 


p' = p[ cos k 0 x — k 0 Pi sin kox. 


(hollow) 


(77) 


We now denote quantities at the first boundary at the left by the 
subscript combination 12 (see Fig. 10-3) and at the right by the com- 
bination 21. Therefore there results 


Pi 

P 12 = Pi cos 4 t~ sin kohi 

«o 


pit = Pi cos kolo ~ koPi sin kolo- 


( 78 ) 
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In similar fashion the 4 and 4' at any point in the first crest region will 
be given in terms of fai and 4n immediately at the right of the bound- 
ary by 

^21 . 

4 = 4 2 i cosh k\x -{- — — sinh k\x, 

(crest) (79) 

4' - cosh k\x + ki4ai sinh k\x. 


Hence (to consult the diagram once more) 

4aa - fai cosh 2 Vi + — - sinh 2 hih, 

4 * 23 ~ ^ 2 i cosh 2k\li - 4 - k\ 4 / 2 i sinh 2h\l\, 


(80) 


Finally, we can employ (77) again to write 

4a = 4a2 cos Vo + s ‘ n Wo, 
*o 

4a = ^32 cos Vo — ko4a2 Sin Vo- 


(81) 


The boundary conditions are 

^12 = 1 ^ 21 ; ^12 = 4al 

423 ~ 4 32 1 423 = 4a2* 


(82) 


Our task now is to use eqs. (78), (80), (81), and (82) to express 43 and 
4a in terms of 4i and 4i- The resulting equations should hold for the 
state functions and their gradients at any two successive mid-hollow 
points and therefore tell the general story of the electron transmission 
through the periodic potential field. If we utilize the boundary condi- 
tions, eqs. (80) and (81) become 

4i 2 

423 = ^12 cosh 2k\l\ + — — sinh 2 Vx> 

*1 


and 


423 — 4 i 2 cosh 2V! + M 12 sinh 2Vi> 

4a = 428 cos ko lo + sin kolo, 
k 0 

4a = 423 cos kolo — ko423 sin Vo- ' 


(800 


(810 
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Eliminating ^121 ^121 'Psa, and i/4a from the six equations (78), (80'), 
and (81') yields finally 


'Pa = Pi cosh 2kil\ ■ cos 2k 0 l 0 + - sinh 2A 1 / 1 *sin 2^oj 


f cosh 2kili -sin 2kolo + sinh 2kili sin 2 kola + 7^ cos 3 &q/o 

KQ L \Kq k\ 


'Pa = Pi [ 


cosh 2^1?!- cos 2io/o -f 


Uki _ko\ 

2\k 0 kj 


sinh 2h\l\ • sin 2k^i 0 


] 


)]• 

(83) 


— koPi j^cosh 2kih sin 2k^o — sinh 2kyli cos 2 k 0 lo + ~ sin 2 £ 0 lo^j. 

(84) 

Because the structure we are considering is assumed to extend to 
infinity in both directions the relations (83) and (84) will hold for any 
two successive mid-hollow points and we can therefore write for any 
integral j 

• p j+2 = Ap, + ~ p'„ (85) 


Pj+s = Ap, — k 0 Cp„ 


( 86 ) 


where the A, B, C are the bracket expressions in (83) and (84). Exam- 
ination discloses that 

A 2 + BC= 1. (87) 


Hence we may introduce the angle W such that 
A — cos W = cosh 2kih ■ cos 2 k 0 l 0 + ^ 

B = yj~ sin W-, C = sin W. 


smh 2kil\-s,m 2k 0 lo- 
( 88 ) 
(89) 


Then (85) and (86) can be written 

p ,+ 2 = P, cos W + y^yj^p', sin W, 

pl + 2 = P, cos W - PoyM'Pj sin W. 


(90) 

(91) 
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From the symmetrical nature of the infinite structure we are able to 
write for every j 

h+2 = (92) 


where 6 is independent of j and is a function of the energy and physical 
parameters of the structure only. Then 


ti+z = «‘Vj. 

This means' that 

(92') 

^+*M +2 = i!jh = z. 

(93) 

where Z is a constant independent of j. If we now divide (91) by (90) 
we obtain after a little reduction 

Z = tk oVf- 

(94) 

The substitution of = ik in (90) then yields 

* 3 


^,+2 = <P 3 e lW , 

(95) 

which on comparison with (92) shows that 


e = if. 

(96) 


If now IF is a real angle, e' w has the absolute value unity and ^ J+2 and 
differ only in phase but not in magnitude. Here there is complete 
transmission of electrons through the periodic structure, since | ^y| 3 
measures the average density of moving electrons at the j'th mid- 
hollow point. On the other hand, if IF is a complex angle \p J+2 differs 
from in magnitude as well as in phase and there is a decrease in 
| i | 2 as one goes from one mid-hollow point to the next. This means 
that when IF is complex there is no transmission of electrons through 
the structure. Consequently we can look on cos IF as a function 
characterizing the possibility of transmission. For cos IF is real 
(cf. eq. 88) and hence we have transmission when 

| cos IF | ^ 1, (97) 

and no transmission for 

| cos IF | > 1. (98) 

If we plot cos IF as a function of the energy E from eq. (88) there will 
be certain energy ranges for which (97) will be true and electrons of 
these energies will be transmitted through the metal represented by 
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the periodic model in question; these will be separated by other 
energy ranges for which (98) is true and electrons of such energies will 
be denied transmission through the metal. In other words we can 
look upon the structure with the periodically varying potential field as 
a kind of filter for electrons. It is indeed analogous to electric and 
elastic wave filters which have been studied extensively.* 

The energy bands for which | cos W | ^ 1 give the allowed energy 
values for electrons which can actually move through the lattice, i.e. t 
those electrons which are of interest for conduction of heat and elec- 
tricity. 

In order to plot these energy bands, let us go back to the expression 
fo r cos W in eq. (88). When E = V 0 , k 0 = 0 and k t — ‘Vs^m/h 2 
V Vi — V 0 . Let us suppose for simplicity that h « lo and indeed 
to such an extent that we can replace cosh 2kfii by 1 + and 
sinh 2kfii by 2kfii to a sufficiently close approximation. Then cos W 
becomes 

cos W - (1 -f- 2*?/?) cos 2kol 0 + hh sin 2k 0 lo. (99) 

When ko = 0, this reduces to 

cos W = (1 -f 2k\%) + 2k\hl 0 , (100) 

which is certainly greater than unity. Consequently for electrons with 
energy in the neighborhood of Vo there is no transmission through the 
structure. On the other hand when the energy has increased suffi- 
ciently so that 2k 0 lo = ir/2, cos W becomes 

cosIT = (^-^)m. (101) 

which can be smaller than unity in absolute value. Thus, to take an 
illustration in which the dimensions are reasonable for a metal crystal 
lattice, let 2 1 0 be of the order of S X 10~ 8 cm and 2/i of order 10~ 12 cm. 
Let V 0 be of the order of 1 electron volt, while Vi is of order 10 to 100 
electron volts. Then cos W in (101) will be much smaller than unity 
and transmission is assured for the energy corresponding to 2kolo = it/ 2. 
When the energy is great enough so that 2k 0 lo — t, cos W again 
becomes greater than unity, taking the form 1 + as may be seen 
from (99). The result is that as E increases from Vo to Vj, cos W oscil- 

7 for elastic wave and acoustic filters with the theory treated in mathematical 
analogy with the foregoing, see R B. Lindsay, “The Filtration of Sound,” Parts 
I and II, Journal of Applied Physics 9, 612 (1938) and 10, 680 (1939). 
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lates in a manner shown graphically (of course only in qualitative 
fashion) in Fig 10-4. It will be seen that as E increases, cos W exceeds 
' unity at periodic intervals given approximately by 

2ko h = nv, n — 0, 1, 2 * • • . (102) 


For small energy ranges in the vicinity of the values corresponding to 
(102) there will be no transmission. These regions are indicated by 
cross-hatching in the figure. The corresponding energy values are 
given by 

n 2 h 3 


E — V 0 - 


32 mil 


( 102 ') 


It will be noted that the forbidden regions grow progressively nar- 
rower in width as the energy increases. This can be seen on exam- 



ination of (99). The unshaded regions in Fig. 10-4 correspond to the 
transmitted energy bands. The criterion (102) for the non-trans- 
mitted energy values has an interesting significance connected with the 
possibility of interpreting ko in terms of the de Broglie wavelength of 
the electron. For since the latter wavelength is A = h/mv, we see 
from (72) that 




2t 
A ’ 


and hence the wavelengths not transmitted are those in the immediate 
vicinity of 



A 


n 


(103) 
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If we used the parlance of electrical and elastic wave filters we should 
call the metal lattice here being considered a high-pass electron filter. 
Of course it is also a band-pass filter as well 

The question may be raised: What happens when E > Fi? The 
solutions then become harmonic in both crests and hollows. This 
can be readily accommodated to the solution above by merely taking 
k\ pure imaginary. Thus we set 

k\ = tk[ 9 

where k[ is real and positive. Then we have 

cos W — cos 2k[li cos 2kola — ^ ( + 7 ? ) sin 2k[li sin 2k 0 l 0 . (104) 

2 \fco »i/ 

If 2 k[lx is still small, however, we can use the same approximation as in 
the previous case with the same general consequences. Of course, as 
the energy increases very considerably 2k[li will eventually become 
large enough to lead to oscillations in cos 2k[li and sin 2k\l\ though at 
a slower rate than cos 2^o- At the same time the ratios k[/k 0 and 
ko/k[ will approach unity. The result is that cos W approaches 

cos W = cos 2(k[k -f k 0 l 0 ), (105) 

with the transmission of almost all energies, the non-transmission 
bands being reduced to negligible width compared with the transmis- 
sion bands. 

The structure considered in this section is idealized to the extent 
that it extends to infinity in both directions. One might suppose that 
the situation would be different in a finite lattice which corresponds 
more exactly to an actual metal crystal. However the analysis 
indicates that the energy bands for transmission are not seriously 
affected by the change to the finite case. 

In order to apply the ideas of this section to actual metallic lattices 
it is necessary to generalize them to three dimensions. This compli- 
cates the analysis considerably and we shall not embark on it here. 
The results are similar, however, in the sense that there are certain 
energy bands for which electrons are allowed to move freely through a 
metal crystal. This does not mean, of course, that an undisturbed 
metal in equilibrium will always have some current flowing through 
it; in equilibrium there will always be as many electrons moving on the 
average in any one direction as in the reverse direction. However the 
imposition of an electric field or a temperature gradient will upset the 
equilibrium and produce a net flbw in a certain direction. The theory 
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can also distinguish between good and bad conductors. In a bad con- 
ductor we have to suppose that all the lower allowed energy bands are 
filled with their complete quota of electrons, and that this is true even 
for the valency electrons which are the fastest moving electrons in the 
atoms composing the crystal. In order to produce a current it is neces- 
sary to transfer electrons in lower energy states to higher ones across 
the gap separating the allowed bands. If these gaps are sufficiently 
great it may take very considerable energy to cross them and the sub- 
stance will then act effectively like an insulator. If on the other hand 
the energy bands available for the fastest moving electrons are not 
entirely occupied, electrons can be transferred to higher but neighbor- 
ing energies without going out of the energy band and hence with 
effectively little change in energy and the substance behaves as a 
metallic conductor. 

7. APPLICATION OF STATISTICS TO MAGNETISM 

It is well known that an external magnetic field of intensity 
induces in a metal a magnetic moment per unit volume, i.e., an inten- 
sity of magnetization ^ which for weak fields is directly proportional 
to the field intensity. We can write 

& = X&tf, (106) 

where x is known as the magnetic susceptibility. If x > 0 and is a 
constant independent of the field for moderate fields the metal is 
termed paramagnetic, while if x < 0, it is called diamagnetic. Metals 
possessing a finite value of e^in the absence of a field and a suscepti- 
bility which varies with the field intensity are called ferromagnetic. We 
shall not be concerned with this type here and in fact shall concentrate 
our attention almost exclusively on paramagnetism. 

When a magnetic field of intensity changes the intensity of 
magnetization of a metal by d&, the amount of energy required per 
unit volume is the scalar product 

dU = offd*?. (107) 

When the field changes from 0 to &C the change in energy density is 
then (assuming (106)) 

U# - U 0 = - (108) 

This represents a gain or loss in energy of the metallic electrons which 
are responsible for the magnetism. For paramagnetic substances, there 
is a loss in energy in the field, the electrons tending to line up their 



Sbc. 7] APPLICATION OF STATISTICS TO MAGNETISM 


273 


magnetic moments parallel to and in the direction of the field. Dia- 
magnetism is associated with the effect of a magnetic field on electrons 
moving in closed orbits in which the magnetic moments induced by the 
field are antiparallel to the field. 

In quantum mechanics 8 it is shown that the magnetic moment of 
the spinning electron is 


__ eh 
11 4ir me ’ 


(109) 


where e is the charge on the electron in esu and c is the velocity of light 
in cm/sec. Strictly speaking to get the total magnetic moment of an 
electron in an atom one must add to (109) the magnetic moment due 
to the orbital motion of the electron. The valence electrons of metal- 
lic atoms, however, may be considered free to a first approximation and 
hence their whole magnetic moment may be taken to be that due to 
spin. We shall confine our attention to these free electrons. 

If an external magnetic field is imposed on a metal, it is assumed 
that the free electron spins are oriented either parallel or antiparallel 
to the field. Hence if E is the energy of an electron in the absence of the 
field, then its energy will be E — iijffi f it lines up parallel to the field 
and E + pJ^fif it lines up antiparallel to the field. 

The number of electrons in the energy range (E, E + dE) in the 
absence of the field is taken to be the usual Fermi-Dirac distribution, 
viz., 8 


N(E)dE = 


8 wr/h 3 rrpVlEdE 
1 + e~^ /kT ' 


(HO) 


This follows from eq. (66) of Chapter VIII with 2r in place of r and 
o = l. The notation otherwise remains the same as in Chapter VIII. 
We shall find it simpler to write 


with 


N(E)dE = D(E)f(E)dE, 


( 111 ) 


8 7r rm^V / 2 E 1 ..... 

D(E) = ; /(£) = 1 + e ~ yie s/kT‘ ( 112 ) 


On the average in the absence of a magnetic field half of these electrons 

■Cf., for example, Lindsay and Margenau, “Foundations of Physics,’’ p. 480. 
•The treatment here follows in the mam Frdhhch “Elektronen Theorie der 
Metalle,” pp. 145 ff. Berlin, 1936. 
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have one type of spin and half the other. Hence when the field is 
applied the number of electrons with parallel spin is 

NpdE = ~~f(E - »3t)dE, (113) 

At 

while the number with antiparallel spin is 

D(E) 

N a dE = -— /(£ + ^Jf)dE. (114) 

In these formulas the D(E) is left unaltered, since D(E)dE is the num- 
ber of energy values lying between E and E + dE and is unaffected 
by the field. 

Now at T = 0, all states with energy less than the maximum value 
<j>, which is equal to h 2 /8m ■ ( 3N/rr ) M (eq. 135 of Chapter VIII with 2r 
replacing r as usual for electrons), are occupied, while all states with 
energy greater than 4> are empty. If we agree to denote the maximum 
kinetic energy of electrons with spin parallel to the field by E v and the 
maximum kinetic energy of electrons with antiparallel spin by E a , we 
have 

Ep = <t> + Ea — <t> — (115) 

It is clear that there is an excess of electrons with parallel spin. In fact 
we can compute the excess as (note that for T = 0, f(E) = 1) 

* - , •«£ r^vTEtE. <m> 

2 2h 

Since <t > » for magnetic field intensities less than 0 5 X 10 8 gauss 
we evaluate the integral to a high degree of approximation as 

X = (117) 

Associated with the excess electrons is a magnetic moment per unit of 
volume of magnitude 

J=- (118) 


Consequently in so far as the magnetic properties of metals can be 
considered as due to free electrons, all metals should be paramagnetic 
and possess at T = 0 a susceptibility 

X = ix 2 DW/t. (119) 
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Actually this is not the case for many metals are diamagnetic. Never- 
theless there are some metals, e.g., the alkalies, for which the above 
assumption appears justified and which permit a comparison with 
experiment. The values are not given, of course, at T = 0, but an 
extension of the analysis given above shows that at low temperature 
there is very little variation of x with temperature and consequently 
(119) should apply fairly accurately even when 7^0. In utilizing 
(112) we have 

DM = 


The substitution of 4> = h 2 /2m 
ceptibility 



This can be further simplified by the introduction of the value of p 
from (109), whence 

X « 1.9 X 10" 8 (^) M , (122) 

where p is the density, A the atomic weight and n 0 the number of elec- 
trons per atom. The following table gives the comparison between the 
experimental values for the alkali metals and those computed from 
(122). It should be noted that the values in the table are mass sus- 
ceptibilities and are obtained by dividing x in (122) by the density. 
The values quoted are in multiples of 10 ~ 6 . 

Table of Susceptibilities of the Alkalies 


(Taken from Mott and Jones, "Properties of Metals and AlloyB,” 1936.) 



Li 

Na 

K 

Rb 

Cs 


x (computed) 

1 S 

0 68 

0 60 

0 32 

0 24 


X (observed) 

0 S 

0 51 

0 40 

0 07 

-0 10 

Honda and Owen (1912) 

x (observed) 

— 

0 65 

0 54 

0 21 

0 22 

Lane (1930) 


8irrm* i x / '2$ 


h 3 


( 120 ) 


/3N_\ H 

\8irT/ 


yields finally for the sus- 


The calculation of the diamagnetic susceptibility involves a con- 
sideration of the effect of the magnetic field on the orbital motion of the 
atomic electrons. The analysis is beyond the scope of the present 
volume. 10 

10 Reference may be made to Mott and Jones, op. cit , pp. 201 ff. 
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PROBLEMS 

1. Calculate values of the electron mean free path for sodium, copper, silver and 
gold from eq. (39) by using experimental values for the electrical conductivity at the 
temperatures -200“ C, -100° C, 0° C, 100° C, and 200“ C. 

2 . Calculate values of the electron mean free path for the metals listed in Prob- 
lem 1 from eq. (47) by using experimental values for the thermal conductivity. 

3 . Derive the expressions for the thermal and electrical conductivities using the 
classical MaxwcV-Boltzmann distribution function for/o. Show that the Wiedemann- 
Franz ratio in this case is 2 fe 2 /® 2 . 

4 . Derive the expression for the Thomson coefficient m on the basis of the classical 
Lorentz theory. Evaluate it for the case of silver 

5. Verify eq (69) and carry through the calculation for the thermoelectric emf 
of a copper-iron couple with junction temperatures 0“ C and 100“ C respectively. 

6. Verify eq. (87). 

7. Show that if the saw-tooth potential curve in Fig. 10-3 is replaced by one in 
which the variation in potential from one region to the next is continuous and very 
gradual the corresponding infinite linear lattice passes electrons of all energies. 
(Cf. the acoustical analogy in R B. Lindsay, J Acous Soc. Am , 12 , 378 [1941) ) 

8. Prove that the amount of energy per unit volume associated with the change of 
the intensity of magnetization of a metal by amount && by a field of intensity vtf is 

dV =4? d&. 
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EMISSION OF ELECTRONS FROM SURFACES 

1. SIMPLE STATISTICAL THEORY OF THERMIONIC EMISSION 

The emission of electrons from hot bodies, the so-called thermionic 
effect, is so well known and its importance in industry so well realized 
that it is unnecessary here to give a detailed description of the experi- 
mental facts. A good review of these will be found in the article by 
S. Dushman in “Reviews of Modem Physics,” 2, 381 (1930). The 
aim of the present discussion is the statistical derivation of the Richard- 
son equation giving the thermionic current as a function of the temper- 
ature of the emitting metal. The method used is closely allied to that 
employed in Chapter X for the study of electrical conduction in metals 
and it is assumed that the emitted electrons form in the metal a degen- 
erate gas obeying the Fermi-Dirac statistics. 

The emitting metal is assumed to be in a vacuum with the emit- 
ting surface plane and perpendicular to the x axis. We further sup- 
pose that only those electrons are able to leave the metal for which the 
velocities in the x direction exceed the critical value v xo . This corre- 
sponds to the kinetic energy which we shall designate as E c . It 

presumably will be a characteristic constant for the metal. Its physical 
significance will be discussed later. The thermionic current density 
will then be given by an expression like eq. (16) of Chapter X, except 
that in the integration only the velocity components in the y and z 
directions are allowed to run from — oo to + oo , whereas v x has the 
lower limit v xo • Moreover the general distribution function / is 
replaced by f 0 , since no external field is applied. The expression for the 
thermionic current density therefore is 

J " 2J ¥L„ £„ £ 1 + e ~-*i+ s/kT dv * d v v dv *' o) 

The evaluation of the integral is materially simplified by the fact that 
E in the integrand can no longer take on the value zero as in eq. (16) 
in Chapter X. In fact the minimum value of £ is E c . It develops that 
for temperatures actually employed in thermionic emission E e /kT— 
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7 i » 1, Consequently the unity in the denominator of the integrand 
can safely be neglected compared with e ~ yi+E/iT and J becomes 


2 mh ,, /•+* /■+• . ElkT . . . 

J = — - — e yi I I I v x e ' dv x dv v dv„ 

H J —oo */— ao »/»-* 


( 2 ) 


The integration over v v and v, is immediate from the well-known 

/ +* 

. e~ aut du = V it / a. The result is 

00 




4rem 2 kT 


• yi f v x e- m ^ 2hT dv, 

J*xo 


or on performing the final integration 


_ 4remk 2 T 2 Xc/tT 
J ~ h 3 ‘ 


(3) 


(4) 


If we were to use the non-degenerate form for 7 ! (eq. 68 of Chapter 
VIII with 2r in place of r since we are dealing with electrons), the 
result would show the thermionic current density as 


J = 


— JIL e - E ‘ /kT . 

T ' 2wm 


(5) 


On the other hand the more reasonable assumption of the degenerate 
form for yi (cf- the first approximation in 128 of Chapter VIII with 2 t 
in place of t, as usual) yields simply 


4irmek 2 T 2 

J = — W~‘ 

where for convenience we have set 


% 

(6) 

( 3N\ h 

i\8ir t) 

(7) 


Both eqs. (5-6) were set forth by Richardson, 1 who made the first 
elaborate studies of the phenomenon. It proved rather difficult to 
decide between the two experimentally, since the temperature depend- 
ence enters much more critically through the exponential term than in 
the multiplicative coefficient. Nevertheless (6) is the form which has 
come to be considered the better representation of the effect. 

We must, however, make clear the significance of the quantity 
<f>. At first its presence might seem rather anomalous since E c has 

*0. W. Richardson, “The Emission of Electricity from Hot Bodies,” Longmans 
(1921). 
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already been defined as the minimum kinetic energy for which elec- 
trons are able to leave the metal. The experimental interpretation 
of the exponent of t in the emission formula is that it represents (when 
its sign is changed and it is multiplied by kT) the work function or 
energy necessary to get an electron out of the metal. Now clearly E c 
would actually represent this quantity only if all the emitted electrons 
were originally at rest. However it is most natural to assume that the 
electrons which get out are those with greatest kinetic energy and the 
work to get one of them out will be the difference between E 0 and this 
maximum kinetic energy. We have already shown (eq. 135 of Chapter 
VIII, with 2r in place of r) that the maximum kinetic energy in a 
degenerate electron gas is 



(70 


But the comparison between (7) and (70 shows us that <f> = P — and 
our assumption then appears to be justified. The exponent — (E c — <£) • 
1 /kT in the emission law is consistent with the hypothesis that it is the 
fastest moving electrons which leave at any given temperature. 

In the derivation leading to (6) we have assumed that (E c — 4>)/kT 
is large compared with unity. It is now necessary to justify this. 
We have just seen that E e — 4> represents the work necessary to 
get one of the fastest moving electrons out of the metal, i e , it is 
the work function. Measurement indicates that in ordinary metals 
this is of the order of magnitude of several electron volts, but kT = 
1.37 X 10~ ia T ergs and, even for T around 1000° K, is a small 
fraction of an electron volt. Consequently our assumption is vali- 
dated. Since J > <t>, it follows that v x0 > Vo where » 0 = h/m- 
(31V/8 tt)* (cf. 139 of Chapter VIII) is the velocity at which the 
distribution function fo drops to the value Consequently v xa is 
farther along on the distribution curve (Fig. 8 • 2) than »o and corre- 
sponds to a part of the curve where the latter is approximately exponen- 
tial. For this reason we see that the distribution for those electrons 
which are concerned in thermionic emission is approximately Max- 
wellian. 

When we consider the exact experimental verification of (6) we meet 
an interesting situation. Evaluation of the coefficient ivemk 2 /k 2 in 
formula (6) gives 120 4 am peres/cm 2 (degree C) 2 approximately. 
This is just about double the experimental value for a number of pure 
metals, namely Ca, Mo, Ta, Th, and W. This discrepancy can be 
accounted for by assuming that at the surface of the metal there exists 
a potential barrier at which a certain percentage of electrons will be 
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reflected even when their kinetic energy is greater than the potential 
energy associated with the barrier. Such a situation is not possible in 
classical mechanics but proves to be realizable in quantum mechanics. 
To calculate the reflection coefficient it is necessary to review the 
quantum mechanical theory of the transmission of electrons through a 
potential barrier. 


2. TRANSMISSION OF ELECTRONS THROUGH A POTENTIAL BARRIER 
AND APPLICATION TO THERMIONIC EMISSION 

We shall assume as before that the plane emitting surface of the 
metal is perpendicular to the x axis. The Schrodinger equation 
(cf. 3 and 52 of Chapter VIII) for an electron with three degrees of 
freedom moving in a field of force which is a function of x only and is 
characterized by a potential energy function V(x) is 

V(x))+ = °. (8) 


We try a solution in the form 


where 


Here we have set 


f(x, y, z) = \p{x)e~'^ ky+lk,) , 

2 8r 2 mE v 2 8ir 2 mE, 

*v - h 2 

E = E x + E y + E t , 


(9) 

( 10 ) 

(ID 


and think of E x as the kinetic energy associated with the x component 
velocity, etc. If we substitute (9) into (8) the result is 


dfy(*) 8ir 2 W 
dx* 


[E x - V(x))Ux) = 0 


( 12 ) 


for the determination of t(x). 

The question now arises: How are we to represent the function 
ywyj V(x)? Since we do not know the 
precise variation in potential at the 
boundary of the metal we make the 
y«o x II simplifying assumption that V(x) = 0 

x ~ 9 inside the metal, while V(*) = V 0 

Fig. ll-l. 'everywhere outside the metal. This 

' corresponds to the discontinuous 

jump in potential energy schematically sketched in Fig. 11-1. The 
region I to the left of the origin corresponds to the metal and the 
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region II to the right of the origin corresponds to the vacuum outside 
the metal. The boundary is chosen at * = 0. The SchrSdinger equa- 
tions for regions I and II then become respectively 


d 2 Hx) 

dx 1 

d 2 t(x) 

dx 1 


+ 


8ir 2 m 

~1F 


E x + = 0 , 


(I) 



W = 0, 


(II) 


(13) 

(14) 


where E x is of course positive. The general solution of (13) is (cf. eqs. 
55 and 56 of Chapter VIII) 

*i - (15) 

with k\ — 8* 2 mE x /h 2 . If is assumed to be a harmonic function of 
the time we can interpret Aie~ ,kix as a plane harmonic wave pro- 
gressing from left to right while Bie tilX corresponds to a similar wave 
in the opposite direction. This interpretation makes for greater 
picturesqueness in the discussion. The solution of (14) depends on the 
relation between E x and V 0 . Let us first assume that E x > V 0 . The 
general solution of (14) then becomes 

* 2 = A*’** + Bit'**, (16) 

with k\ = 8x a m(E x — V 0 )/h 2 , where we shall employ the wave inter- 
pretation as in (15). The functions and must satisfy the bound- 
ary conditions 

*i(0) - *(0), (17) 



In other words there must be continuity in both the $ function and its 
gradient in crossing the boundary. The use of (15) and (16) in (17) 
and (18) yields 


Ai + B\ — j4 3 + B 2 

(19) 

A x - 2*i = ^ • (A a - B 2 ). 

(20) 


If we assume that the region II extends indefinitely to the right there 
will be no f wave function corresponding to motion ih the negative * 
direction in II. We must therefore take B 2 — 0. With this choice the 
solution of (19) and (20) leads to the ratio of reflected to incident charge 
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density (cf. the physical meaning of the p function, Sec. 
VIII) 

\*r\\ j 2 (fel - h? 

|^| 2 OT (* l +* 2 ) 2 ’ 


1, Chapter 
( 21 ) 


while the corresponding ratio of transmitted to incident density is 


Pt 




A*\* 


4*? 


^il 2 


(*i + *2) 2 


( 22 ) 


To get the relative average rates of flow of charge we must multiply 
the charge density in each case by the electron velocity. To the left 
of the boundary this is 


J2E X 

»i = \ 

’ m 


* 1 h 
2-rrtl ' 


(23) 


whereas to the right of the boundary 



*2 h 
2irm 


(24) 


If we define the transmission coefficient D as the ratio of the average 
rate of flow of charge away from the boundary in II to the average 
rate of flow up to the boundary in I, we obtain 


4*1*2 

(*1 + * 2 ) 2 


(25) 


Similarly the reflection coefficient R is 


(*1 ~ h ) 2 
(*1 + * 3 ) 2 ' 


(26) 


We see that R + D = 1, as should be expected. It will be observed 
that unless *1 = * 2 , i e , Fo = 0. there is always some reflection, which 
increases in amount as F 0 increases. In fact as Vq — > E, * 2 — > 0 and 
R — » 1. The distinction between classical theory and quantum me- 
chanics is well illustrated by the fact that whereas on classical mechan- 
ics all electrons with energy larger than F<j should be able to climb to 
the Vo level or plateau, quantum mechanics predicts some reflection 
even in this case, a reflection that, to be sure, decreases as rapidly as 
E — Vq increases. 
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It is left as a problem to the reader to solve the transmission prob- 
lem for E < V o and to show that 


h 2 4k! 

2 ki + kf 


(27) 


However, the velocity in II now becomes imaginary and hence there 
can be no average flow of charge through II. Indeed the wave function 
in II is an exponential function of x with a negative exponent. 

We are now ready to apply the theory of this section to thermionic 
emission. The fundamental eq. (1) must be rewritten to take account 
of the transmission coefficient which is a function of v x . The thermi- 
onic current density now becomes 


J = 


2 m 3 e f + * /*+" v x D(v x ) 

h a J-„ X» J txa 1 + e-^ s/kT 


dv x dv v dv. 


(28) 


where D(v x ) is given by eq. (25). If we employ the same type of 
approximation used in the first part of the section, J becomes 

j m *SgL<nr D(E x )e~ Ex/kT dE x (29) 

» Js, 

The transformation u = (E x — E e )/kT changes this to 

J = g l*-B c)/ kT . £ D{u)e ~u du (30) 


Now if F 0 = E e , eq (25) yields 

_ . . WukT y/ E e + ukT 

D(u ) = - 7 == — . 

WukT + VE e + ukT } 2 


(31) 


From what has been said previously kT/E c « 1. For values of 
u > Ee/kT we get very little contribution to the integral in (30) 
because of the exponential factor e~“. T herefore the evaluation can 
best proceed by expanding in powers of WukT/E c . We have 


r 


D(u)e~ u du - 4 


jf 


e~“ Vukf[l+ukT/2E e + • 1 du 
WE e WukT/E, + VT~+ ukT/E e ] 2 ’ 


(32) 


After a little reduction this becomes 



~ u du = 4 



3 ukT 
2 E c + 
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'du — 1 and 


Utilizing the fact that / s/u e~ u du = s/r/l, / ue~' 

Jo Jo 

y* u h e~ M du = 3\/t/ 4, we can finally write to the indicated approxi- 


mation from (30) 

4irem£ 2 F 2 


4 v*r/£ B 


■[v*/2 - 2\/*r/s e - jj v; • + ...]. 


(33) 


The factor W kT/E e • [V^/2 - lVkT/E e . • •] can be considered as 
an effective average transmission coefficient and denoted by D{u). If 
the temperature is 1000° K, AT is approximately 0.1 electron-volt and 
if 2? e j= 10 electron-volts, kT/E e = 0.01. Substitution indicates that 
here Z>(u) = 0.27 approximately. While this is not equal to the 0 50 
apparently demanded in order to malm the theoretical J agree with 
experiment for a number of metals, the order of magnitude agreement 
is sufficient to indicate that the method of explanation is at any rate on 
the right track. It is very unlikely that the abrupt potential barrier 
visualized in Fig. 11 -1 is actually realized in fact. It is much more 
likely that the situation is that depicted in Fig. 11-2 where the transi- 
tion from V — 0 to V = V 0 takes 
place more or less gradually. This 
will decrease the reflection for given 
excess energy , i e., E x — E e , and 
hence increase D{u). However, the 
analysis will in general be rather 
complicated. Moreover the exact 
shape of the transition curve in Fig. 1 1 • 2 is unknown anyway. How- 
ever, the value of the amplitude term in the thermionic current varies 
very widely and values higher than 60 amperes/cm 2 (degree C) a are 
not uncommon. 

In the preceding discussion we have neglected a force whose influ- 
ence is to round off the course of the potential energy function as 
indicated in Fig. 112. This is the so-called image force from electro- 
statics. It corresponds to an attraction of magnitude e 2 /4x 2 when the 
electron is at distance * from the interface of the two media. Associ- 
ated with this is a potential energy function V — — e 2 /4x. We shall 
consider its use in Sec. 4. 



v=v„ 


Fig 11 2. 
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3. TRANSMISSION OF ELECTRONS THROUGH A POTENTIAL HILL 
AND APPLICATION TO THERMIONIC EMISSION 

It is conceivable that if the surface of a metal is coated with a film 
of some foreign substance the nature of the potential barrier will be 
materially altered from that shown in Figs. 11*1 or 11-2. Indeed it 
may look rather like Fig. 11 3 , where 
the potential rises at the surface of 
the metal to F = Vo and after the 
thickness l is traversed, falls to the 
value V = V\. Let us compute the 
electron transmission through such 
a hump. We shall assume that 
E x > Vi though it need not be 

greater than F 0 , as will be seen. The solutions of the one-dimensional 
Schrodinger equation for the regions I, II, and III are respectively, 
with E x > V o > Vi ^ = Aie -** _f_ 

*2 - Ate-'** + B***, ( 34 ) 

* 3 = Ate-'**-", 



| m 

r-r, 

F«=o I 

■»-+-*{ 
JL l__ 

hi 


a?«*0 


Fig 11*3. 


x 


2 8 * 2 mE, 

where k x ~ — — 


8ir 3 tn ... , 8v 2 m 

k 2 — - {E x — F 0 ), and k 3 ~ ~rx~ {E x — Fi), 


l 2 r U/» a-nxA /v 3 — 2 

", n 

and k\ , k 2 , and k 3 are all real. The term B 3 e' ktt has been omitted for the 
usual reason. The factor (x — J) is used in place of x in \ff 3 merely for 
convenience. The boundary conditions at * = 0 are 


Ai + Bi — A 2 + B 2 , 


ko 

- (35) 

A 1 -B 1 =-f-(A 2 - B a ), 


A a r* * + S 2 e* w = j4 3 , 


A 2 e~'*> 1 - B 2 e ,w = ^ • A a . 

* 2 

(36) 


and at * = l, 


The problem is to eliminate 5j, A 2 , and B 3 between (35) and (36) 
and obtain A 3 in terms of A\. We shall find it convenient to set 
k a /ki = &21 and k 3 /k 2 = k 32 . Eliminating B\ between the two equa- 
tions (35), we get A\ in terms of A s and B a . We can solve (36) for 
A a and B a in terms of A 3 . The result is 

A Ml , 

8 (1 + *«i)(l + k 3a )t** + (1 - 4ji)(1 - k 32 )e 


( 37 ) 
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On expansion this can be written 


As = 


2Ai 


Therefore 


(1 + * 21 * 32 ) cos * 2 ! + *(*21 + * 82 ) sin ^ 

2 4 

* (*21 + £32 ) 2 + (*21 - 1 ) (*32 — 1 ) cos 2 kal 


(38) 

(39) 


To get the transmission coefficient we must introduce the electron 
velocities in regions I and III namely 


*1* *3 h 

2vm ' 2iem 


Proceeding as in Sec. 2, and setting *s/*i = £31 gives for the transmis- 
sion coefficient 


A 


4*31 

(*21 + ^ 32 ) 2 + (*ll - 1)(*82 — 1) COS 2 k 2 l 


(40) 


Before considering D\ further, let us examine the case in which F 0 > 
E, > Vi. Then since * a is now pure imaginary, we find it convenient 
to set * 3 = 0%, *21 = »* 2 i. and * 32 = — t **2 where k%, **1 = **/*i> 
and k* 2 = * 3 /*a are all real and positive. Substitution into eq. (37) 
gives after rearrangement of terms 


As = 

This leads to 


2At 

(1 + * 21 * 32 ) cosh *2 / — t(*2i — **2) sinh k*l 


(41) 


A a \ 2 4_ 

^4i| 2 ~ (1 + *g)(l + *S) cosh 2 ktl - (& - k$ 2 ) 2 ' 


(42) 


The interesting thing about this result is that in spite of the fact that 
E x < P 0 , there is still transmission through the potential barrier. 
Indeed the transmission coefficient is 


.* L* 


D, = 


(1 4- *$)(! + ktl) cosh 2 ktl - (k* 2X - k* 32 ) 2 ' (43) 


If we resubstitute in terms of the actual energies, the result is 


D a = 


4 V (E x - Pi )/E x 




(44) 
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The transmission vanishes for E z < V i < F 0 , as is evident from the 
remarks made immediately following eq. (27). However (44) indicates 
that the transmission coefficient rises from zero to a definite value as 
E x goes from V\ to F 0 and thereafter increases as E x increases. A 
rough plot of the behavior of D as a function of E x is shown in Fig. 



11-4. This is, of course, a composite of D 2 (from Fj to F 0 ) and D\ 
from F 0 on. 

The thermionic current in this example will be evaluated from 
eq. (30) on substitution of the appropriate value of D. Thus we seek 
(placing E a - Fj) 


D(u) 




D(ukT + Fi)« - “ du. 


(45) 


This can at once be split up as follows 


/ ir c -ri)/kT 

D 2 e~ u du -f- / D 1 e~ u du (46) 

‘'(.Yt- Vi)/kT 

with 

WukT/(ukT + FQ 

1 = r IVt-Vo+ukT I ukT I 2 Vo(Vi-Vo)yx 2 k 2 l 

L> ukT+Vi ^V^Vo+ukT-i (ukT + Vi)(ukT+ Fj - V 0 ) 

( 47 ) 
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while 


Z> a = 


4 

VoiYo-Vt) 


1 


1+cosh 2lVV 0 -Vi-ukT-VSic a m/h a 
2(V 0 -Vi-ukT) 


. (wkr+Fj) r 

J » kT -- 

/Fo-Fj-ttkrl 

Fo(Fo-FO L 

v Fo-F,-«kr 

* wifer+Fx -I 


(48) 


Now in the evaluation of the second integral in (46) u > (F 0 — Vi)/kT 
or Vi + ukT > F 0 . Hence for most of the way from « = (F 0 — Fi) 
1/kT to oo , D\ will approximate unity. Therefore the second integral 
becomes of the order of e~ <r °" r ‘ >/ir Keeping this in mind we 
examine the first integral. By a slight rearrangement and expansion 
of the square bracket in the denominator of (48) we have 


16 VukfVukT + F,(F 0 - V l - ukT ) 
2 " Wo ~ Vt) D 


(49) 


where 

D _ 2 -j- Vx-u*T- _J_ e -Zl^r tt -Vi-ukT 

±ukT(ukT + F x ) + 2(F 0 - Vi - ukT) 2 

-&VukfVukT+ Vi(V 0 - Fi - ukT) 
Vo(V 0 - Vt) 


Evidently the dominant term in D in the range of u involved in the 
first integral is the second. If we neglect the rest we obtain 

(Vj-vo/tr 

Dae~ u du 

16 riri-vo/*T 

' Jl , ' VM( ‘ ir+l ' I | 

• (F 0 -Vi- u kDe~ 2lVyt ~ Vl ~ ukT ^ v,i,rim/h, du. (50) 



After some expansions based on the smallness of ukT/V i, etc., we ' 
finally arrive at the approximate result that the above integral is 

^ • VVikT-iV o - F 1 )e _2 ‘ VFa " Fl ' V8 ' ,m/Ai . 

M 


( 51 ) 



Sec, 4] 


EFFECT OF STRONG FIELDS ON EMISSION 


289 


This leads to 

D(u) = .-*'*=*" + a term of order 

(S2) 


Now for much of the thermionic range we may neglect the second term 
compared with the first. The thermionic current density can then be 
put into the form 




j = dr 2 «- wr , 

(53) 

with 




8\/ir 

A - v„ 

\/~V~kT £ -2I ' /Vn-V ’ ixmek 

1 h* 

(54) 

and 


E* — 




► 

•4i 

» 

i 

ii 

O 

(55) 


In a general way (53) agrees with experiment. For details the reader 
should consult Dushman’s article on Thermionic Emission previously 
referred to, particularly pp. 468 ff. 


4. EFFECT OF STRONG FIELDS ON EMISSION 

In our discussion of thermionic emission we have neglected the 
effect of the electric field actually used to get the electrons away from 
the surface, assuming with apparent success that its influence is very 
small. However, as the external field strength is increased it becomes a 
significant factor. Suppose the intensity of the applied field is F 
(assumed uniform), corresponding to a negative potential energy 
—eFx. Taking the boundary at * = 0, as usual, and including the 
image force potential energy (already mentioned at the end of Sec. 2), 
the total potential energy in the neighborhood of the boundary may 
be written in the form 

e 2 

F = F° - — - eFx. (56) 

Here Vo is the asymptotic height of the potential wall outside the 
metal in the absence of an external field. 2 The effect of the external 
field is to lower the potential below this asymptotic level, as indicated 

* The formula (56) obviously cannot apply right at the boundary, * ~ 0. We 
•hall assume that it holds to within 10A or so of the boundary. 
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in Fig. 1 1 • 5. The first effect is to reduce the effective jump in potential 
across the boundary. The maximum value of V in (56) occurs at x\ = 
y/e/^F and is equal to « V 0 - VF. If we could assume that 

v-v> 



the principal effect of the field really is to lower the effective V by the 
amount e w VF, i e., neglect the portion of the V curve beyond x lt the 
sole correction to the emission formula (33) would be multiplication 
by the factor Thus (33) would become 

J = 4 ^f r2 (57) 

corresponding to increased transmission with increasing field intensity. 
The experiments of de Bruyne * agree rather well with (57). 

As the value of F is increased to about 10® volts/cm, the dip of the 


r-i? 



potential curve near the maximum becomes so great that the above 
approximation no longer holds. The situation is approximately 
depicted in Fig. 11 • 6 in which the effect of the image force, leading to 

•Proc. Roy. Soc. 120, 423 (1928). 
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the rounded off dotted curve, may be neglected and for the sake of 
simplicity the potential course may be represented by the heavy curve. 
This is intended to approximate the situation at absolute zero. To 
the left of the barrier the Schrbdinger equation for the x dependent part 
of the function is 


dV , 

dx 2 + 



0, 


(58) 


and to the right of the barrier 


d 2 \p 8ir 2 m 

d? + ~w 


C E x - Vo + eFx)i = 0. 


(59) 


The complete solution of the transmission problem has been given by 
Fowler and Nordheim 4 and we shall merely quote the result for the 
transmission coefficient (Note: E x < Vo), which is 


D{E X ) = 


WE X (V 0 -E X ) 

Vo 


-H V8,«m/A*(Vo-Jr*) 44 ]/8J'’ 
€ • 


(60) 


By referring to eqs. (28, 29, and 30) we can now write for the electron 
current density 


J = 


4trmekT 


Td{e x ) r - 

Jo Jo < 


di\ 




+ 1 


dE x 


(61) 


where our previous formulas are modified to a certain extent since the 
energy of the electrons cannot exceed the maximum energy <f> = yikT, 
corresponding to low temperatures. In spite of this limitation (61) 
indicates that a non-vanishing current exists. It should be pointed 
out that in (61) ij = m(t£ + i%)/2kT. The evaluation of J depends 
first on the integral 


f 

Jo 1 


di) 


.*+( 


+ 1 


Since E x < <t>, (E x — 4>)/kT is large negatively. Consequently we may 
evaluate the integral from (114) of Chapter VIII using p = 0. It will 
be sufficient to take only the first term in the result (eq. 122 of Chapter 
VIII) and write 

£ S + iE'-ZkT + j ~ i~kT^ )' (62) 


4 Proc. Roy. Soc 119, 173 (1928). 
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Substitution into (61) with the use of (60) yields 

J = (^ - £ *) e_t4 ' V/i ^ (7 °~ Jr * )Wl/3 ^-- («) 

To evaluate the integral, let $ — £» = £ a nd expan d (V 0 — <t> + t) H in 
powers of {. For convenience represent \ / '%tr 2 m/h 2 by n- The integral 
then becomes 

e -4„(7.-*)*/3f ■ AV(« - &(V 0 ~<t> + 

Jo 

We expand the radical in powers of £. The resulting integration is 
simple and the final result becomes 

The experimentally observed emission agrees in general with (64) 
being indeed of the form 

J = KF 2 e~ a/p , (65) 

where K and a are constants. The exponential dependence on F is 
particularly well substantiated by experiment.® 

5. THE PHOTOELECTRIC EFFECT 

An electron emission problem closely allied to thermionic emission 
is the photoelectric effect. When light falls on a cold metal surface, 
electrons are emitted if the frequency of the light exceeds a certain 
threshold frequency vq. We can understand this situation in terms of 
the preceding sections of this chapter, since when the metal is at a 
temperature near absolute zero the maximum kinetic energy of the 
electrons is about equal to <t> (Sec. 1). Consequently in order that an 
electron with energy E less than <t> shall get over the barrier E 0 ■= F 0 
it must get from the incident light quantum the energy hv such that 

hv + E = E t . (66) 

The least frequency which will satisfy this condition is therefore y 0 , 
where Av 0 + <f> = E e or 

v 0 = (E c - <t>)/h. (67) 

* See Fowler, " Statistical Mechanics," second edition, p. 357 and accompanying 
references. 
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The photoelectric threshold thus proves to be vitally related to the 
thermionic work function. This relation is well substantiated by the 
experimental values for many metals as the accompanying table shows. 
In considering the comparison we must, of course, recall that the 
threshold is not usually measured at T = 0 and there is bound to be 
some temperature effect. 

Table op Thermionic Work Functions and Photoelectric Thresholds 
(From Fowler, “ Statistical Mechanics ") 

The values are in electron volts 


Metal 

Cs 

Ta 

Mo 

W 

Ni 

Pd 

hv o 

1 9 

4 11 

4 IS 

4 54 

5 01 

4 97 

(E e - 4) 

1 81 

4 12 

4 15 

4 54 

5 03 

4 99 


We shall give an elementary analysis of the emission based on the 
assumption that the photoelectric current density per unit light 
intensity is simply proportional to the number of electrons incident per 
second normally on the surface of the metal for which 


E x + hv > E c , 


where E x is the kinetic energy associated with the direction normal to 
the metal surface taken as the x axis. Now the number of electrons 
with x component velocity lying between v x and v x + dv x striking unit 
area of surface per second is (from Sec. 1, eq. 1) 


dJ = 


2m 3 e 

“ft 3 " 



dv v dv t 

l + e -yi+mT' 


( 68 ) 


There is a certain advantage in transforming from v x , v v , v, to cylin- 
drical coordinates v x , p and 0, where 

p 2 = + dv v dv t = pdpd<t>. (69) 

Then (68) becomes 

4irm 3 e /*" pdp 

& - h 3 VxdVx J Q j _|_ e -7i+«»i/2tr+mp»/2*r‘ 


The integration yields 

ivtn 2 ekT 
dJ = rs — 


v x log (1 + dVi ' 


(71) 
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The photoelectric current density per unit light intensity is then 

1~K- log (1 + e l +-*^ /kT ) dE s , (72) 

, tr : s a constant proportionality factor. Introduce the trans- 
SSJL’ V eall The current 

density takes the form 


I = 


K i«mek 2 T 2 log (1 + «>) ^ 

Jo w 


h 3 


(73) 


The integral is a well-known one. 
i.e., (a) v £ v 0 for which w 0 £ 1, 
, hv o - hv 

For (a) we have, setting -jy 


We distinguish between two cases, 
and (6) v > v 0 , for which w 0 > 1 . 

= «, 


1 = 


K- 


l*mek 2 T 2 

h 3 




(74) 


For (b) we write 

piog£+«O da; 

Jo w 


■jf 


1 log (1 + w) 


dw + 


a> 


/ 


'•logd+w) 


- dw 


w 




whence for v > "o 


+ + (75) 

We thus see that the photoelectric current density I is given in the 


general form 


y 2 = 4f(«) 


(76) 


where A is the constant KA*mek 2 /h 3 and /(«) is a function to be 
evaluated from the previous equations (74) and (75). We note at once 
the interesting fact that if v = 0, so that a = hv 0 /kT = (E„ - 4>)/kT, 
the equation (76) reduces to a close approximation to 

I = AT 2 e-^-* )/kT , 0D 


which is just the thermionic emission equation (6) with the exception 
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of the multiplicative factor K. The higher terms in the expansion 
(74) i.e., e~ 2a , e~ Sa , etc., are all negligible compared with e~ a in this 
case. 

Following DuBridge 6 we examine the form of eq. (76) when the 
surface of the metal is at T = 0. Here the argument of the function 
/ is + oo or — oo according as v < i> 0 or v > v a . Consequently we get 
the alternative results 


(•Or— o = 0 for v ^ x 0 


(7)r-o = 


K ■ 2vme 
~h 3 


(hv — hv o) 2 for v ^ vq. 


(78) 


If then we plot the photoelectric current density as a function of fre- 
quency, we find no emission out to the threshold frequency * 0 ; there- 
after the emission is a quadratic function of the frequency. The curve 
is indeed a parabola with vertex at v = vo as indicated in the accom- 
panying figure (Fig. 11-7). For T > 0, the plot of (76) using (74) 



and (75) yields curves of somewhat similar shape which, however, 
do not touch the axis at precisely v 0 . It is clear that the threshold 
frequency has a precise meaning only at absolute zero. At higher 
temperatures there is no absolutely sharp threshold, as the current- 
frequency curves approach the axis more gradually. Nevertheless it 
is convenient to continue to look upon v 0 = (E 0 — <f>)/h as the thresh- 
old, as it actually would be if we could lower the temperature of the 
surface to 0° K without altering E c and <t>. We are justified in looking 
upon vo as an important characteristic of the surface. 

Let us go back to eq. (76) and again setting (hv o — hv)/kT = a, 

®L. A. DuBridge, “New Theories of the Photoelectric Effect,” Actuality* 
Scientifique, 268, Paris, 1935. 
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take the logarithm of both sides (base 10 is usually the more conveni- 
ent), Thus 

log io(I/T 2 ) = £ + *(«) (79) 

where B = \og 10 A (a constant independent of v and T) and $ (a) = 
logiof(a) . If log 10 (I/r 2 ) is plotted against a, the resulting curve will be 
independent of the metal and the temperature except for an additive 
constant. In^ other words the curve should be superposable on the 
curve obtained by plotting $(a) against a by a shift along the ordinate 
axis of amount B . The form of 4> (a) is shown approximately in Fig. 



11*8- Unfortunately « is known for given frequency only when v 0 
is known and this depends on the metal. Fowler therefore suggested 
the plotting of the experimental values of log ( I/T 2 ) against hv/kT. 
This curve should agree with the theoretical curve (79) by a backward 
shift along the axis of amount hv 0 /kT and a vertical shift of B, By 
ascertaining the horizontal shift, Fowler was able to obtain theoret- 
ically good values of vo- For more complete discussion of the agree- 
ment with experiment DuBridge’s paper should be consulted. 

We next consider the energy distribution of the emitted photo- 
electrons and examine the case of those electrons which emerge nor- 
mally to the metallic surface. Experimentally the distribution is 
measured by applying a retarding potential between the emitting 
surface and the collecting electrode and then measuring the photo- 
electric current as a function of this potential. The number of photo- 
electrons emerging per second normally from the surface against a 
retarding potential V may be obtained by a slight modification of eq 
(72). Thus we divide by the charge e and in the lower limit of the 
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integral employ E a + Ve in place of E e . The number in question as a 
function of V becomes 

N(V) = — 4 ^” fe - r r log (1 + c ( *~ Xm)/kT ) dE x . (80) 

n JE.+ 7t-k, 

If we call the integrand n(E), i.e., 

»(£.) - log (1 + ,»-«/«), # 

it follows that 

n{E e -h,+ Ve) = K - ~ - r log (1 + e -r./*r.^-,)/*r )( (gl) 

and this is the number of photoelectrons emitted per second by light of 
frequency v per unit energy interval with the energy E — Ve — 
h(v — v 0 ) + 4> *= Ve + Ee — hv. It is called the “normal” energy 
distribution function. At T = 0, it reduces to 

n 0 (E c - hv + Ve) = K [h(y - v 0 ) - Ve]. (82) 

When plotted as a function of Ve — h{v —v Q ), no is a straight line as 
in Fig. 1 1 • 9, intersecting the axis of abscissas at h{v — y 0 ) = Fe. When 
T ?*0, the curve tails off as shown by the dotted line in Fig. 11-9. The 
general agreement with experiment is not too good, probably because 



of the neglect to take account of a transmission coefficient as we did in 
the treatment of thermionic emission. For a complete analysis of the 
experimental data, consult DuBridge. The resul t is that in the vicinity 
of the energy h(v — v 0 )> the experiments check the theory rather well. 

Further investigation of the photoelectric effect would take us too 
far afield. The reader who is interested in the total energy distribu* 
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tion of the photoelectrons, i.e., the distribution taking account of all 
the velocity components, will find a thorough discussion in DuBridge’s 
article. 

6. CONTACT POTENTIAL 

In the immediate neighborhood of the surface of a metal in a 
vacuum there will be at every temperature an atmosphere of electrons 
in equilibrium with the metal. The electrostatic potential of the 
field produced»by this electron gas will be equal to that at the surface 
of the metal; but the latter is a function of the metal. Consequently 
we expect that when two different metals are very close together in a 



vacuum, a potential difference will be set up between them. This is 
the so-called contact potential. We can obtain an interesting relation 
between the contact potential and the thermionic work functions for 
the metals by the following simple considerations. 

In Fig. 11 10 we represent schematically the two metals 1 and 2 
separated by a vacuum. The temperature is kept constant at the value 
T . The average number of free electrons per unit volume of metal 1 at 
the surface is given by eq. (171) of Chapter VIII 

**-; (*.+$)“ 

where Vi is the electrical potential just inside the surface and —eV l is 
the potential energy of the electrons at this place. (We are here, of 
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course, treating e as a positive quantity.) Similarly in metal 2 the 
average density of free electrons is 

<84) 

But from our study of thermionic emission we know that 

(85) 


(86) 

where fa and fa are the maximum kinetic energies of the electrons in 
the two metals respectively. In Sec. 1 of this chapter we used <t> — yikT 
to denote the maximum kinetic energy of the free electrons of the 
metal. However if the electrons are in a force field and possess poten- 
tial energy also, an extension of the analysis of Sec. 7 of Chapter VIII 
shows that it is y\kT + eVi which corresponds to the maximum 
kinetic energy. Hence (85) and (86) are justified, fa and fa are 
sometimes referred to as the inner work functions. From (85) and 
(86), we have 

c(Fi - F 2 ) = (fa - fa). (87) 

Now Vi — V 2 can be written in the form 

Vi - F 2 = F x — F a + F 8 — F 4 + F 4 - F 3i (88) 

where V 3 is the electrical potential in the vacuum at the point 3 just 
outside the metal 1, while F 4 is the potential in the vacuum at the 
point 4 just outside the metal 2. The quantity — e(V 3 — Fj) is the 
minimum kinetic energy for those electrons in 1 which are able to cross 
the surface. If we use the notation of Sec. 1 we should call this (£ e ) 1 . 
Similarly 

~e(V A - F 2 ) = (E e ) 2 . 

Consequently 

fa — fa = (E e ) i — (E e ) a + (F s — F 4 )tf 
or slightly rewritten 

e(V 3 — Vi) = [(£ c )a — ^ 2 ] — L(-Ee)i ~ $i]» (89) 

Now (E c )i — fa is the thermionic work function for the first metal and 
(E c ) a — fa that for the second, while (Fs — F<) is the difference in 


and 


Tl “l 1 'T' 


fa 

kT 


eF 2 _ fa 
yi+ kT ~ kT 
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electrical potential between two points just outside each metallic 
surface respectively. It is by definition the contact potential. The 
relation (89) connects the contact potential between two metals with 
their thermionic work functions. It has been verified experimentally 
for a number of cases, though exact measurements of contact potentials 
are difficult to carry out. For a discussion of the experimental agree- 
ment Fowler’s "Statistical Mechanics,” p. 364, may be consulted. 


PROBLEMS 


1. Evaluate the thermionic current density for molybdenum at T ■ 1,000° K, 
1,500° K, and 2,000° K using both forms of the Richardson equation, x.e., (5) and (6). 
Take E e — 4> = 4 08 electron volts. Compare the results with those obtained using 
the experimental data listed in S. Dushman, Rev. Modem Phys , 2, 381, 1930, p. 394. 

2. Discuss the transmission of electrons through a boundary barrier in which the 
potential rises from V = 0 to V = Vo continuously and very slowly with a con- 
tinuous gradient. Show that in this case there is complete transmission for all 
energies. (Cf. the elastic wave analogy in R. B. Lindsay, J. Acous. Soc. Am , 
12, 378 11941] ) 

3. Solve the transmission problem of Sec. 2 for E < Vo and show that eq. (27) 
results. 

_ 4 . Find the value of E c which makes the effective average transmission coefficient 
£>(«) - 0.50 for r - 1,000° K 

5. An image force potential barrier may be represented by the equations 
V = - Vo , *50 


V 


4* + e 2 / Vg ' 


x 5: 0 


where * => 0 represents the boundary. Find the expression for the average transmis- 
sion coefficient in this case. 

6. Fora monatomic layer of thorium on tungsten assume that Vo = 10.3 electron 
volts and Vi ** 8 4 electron volts. Take l = 2.85 X 10 -8 cm. Compute the 
effective average transmission coefficient. 

7. Plot the photoelectric current density per unit light intensity (divided by the 
constant K ; cf. Sec. 5) for the metals tungsten and nickel at T ■ 500° K and 
1,000“ K respectively. 

8 . Plot the normal energy distribution function of the photoelectrons from 
potassium at 20° C, i.e., eq. (81). 
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Thermodynamics, 41 
first law of, 62 
second law of, 62 

statistical analogies to, 59, 68, 139, 
146, 148, 173 
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